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On the basis of its primary sequence and the location 
of its disulfide bonds, we propose a structural model of 
the erythropoietic hormone erythropoietin (Epo) which 
predicts a four a-helical bundle motif, in common with 
other cytokines. In order to test this model, site-di- 
rected mutants were prepared by high level transient 
expression in Cos? cells and analyzed by a radioim- 
muno assay and by bioassays utilizing mouse and hu- 
man Epo-dependent cell lines. Deletions of 5 to 8 resi- 
dues within predicted a-helices resulted in the failure 
of export of the mutant protein from the cell. In con- 
trast, deletions at the NHs terminus (A2-5), the COOH 
terminus (A 163- 166), or in predicted interhelical 
loops (AB: A32-36, A53-57; EC: A78-82; CD: Alll- 
119) resulted in the export of immunologically detect- 
able Epo muteins that were biologically active. The 
mutein A48-62 could be readily detected by radio- 
immunoassay but had markedly decreased biological 
activity. However, replacement of each of these deleted 
residues by serine resulted in Epo muteins with full 
biological activity. Replacement of Cys** and Cys** by 
tyrosine residues also resulted in the export of fully 
active Epo. Therefore, this small disulfide loop is not 
critical to Epo's stability or function. The properties 
of the muteins that we tested are consistent with our 
proposed model of tertiary structure. 



Humoral regulation of red blood cell production was first 
proposed at the beginning of this century (1). Convincing 
physiologic experiments documenting the existence of eryth- 
ropoietin (Epo)* (2-5) were followed by its purification (6) 
and partial structural characterization (7). The molecular 
cloning of this biologically and clinically important cytokine 
(8, 9) has led to further understanding of its properties (10, 
11). 

The binding of Epo to its cognate receptor (12) on erythroid 
progenitors in the bone marrow results in salvaging these cells 
from apoptosis (13). allowing them to proliferate and differ- 
entiate into circulating erythrocytes. The Epo receptor is a 
member of an ever enlarging family of cytokine receptors (14). 
In like manner, Epo shares weak sequence homology with 

* Thie work was supported by National Institutes of Health Grants 
R01.HL42949 (to H. F. B.) and ROl-GM 39900 (to F. E. 0) and by 
a grant from the R W. Johnson PhArmaceutical Research Institute. 
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payment of page charges. This article must therefore be hereby 
marked *^advtrtisement** in accordance with 18 U.S.C. Section 1734 
solely to indicate this fact 

' The abbreviations used arc: Epo. erythropoietin; IL. interleukin; 
bp, base pair(s); RIA. radioimmunoassay; IPTG, isopropyl-l-thio-/?- 
O-galactopyranoside; WT, wild type. 



Other members of a family of cytokines which also include 
gTo^^th hormone, prolactin, IL-2. lL-3, [L-4, IL-5, IL-6, IL-T. 
G CSF, GM-CSF, M-CSF, oncosiatin M, leukemia inhibitory 
factor, and ciliary neurotrophic factor (15-17). The genes 
encoding these proteins have similar numbers of exons as well 
as a clear relationship between intron-exon boundaries and 
predicted a-helical structure. These similarities have led to 
the prediction that this family of cytokines share a common 
pattern of folding into a compact globular structure consisting 
of four amphipathic a- helical bundles. Such theoretical 
models of the structures of human growth hormone (18) and 
IL-4 (19) have been in remarkably good agreement with 
subsequent structures established by x-ray diffraction (human 
growth hormone) (20, 21) or by multidimensional NMR (IL- 
4) (22. 23). Moreover, the crystal structures of GM-CSF (24) 
and monomeric M-CSF (25) are also in reasonable agreement 
with their predicted structures. 

Thus far. the structure of Epo has not been analyzed by 
either x-ray diffraction or by NMR. In order to begin to gain 
an understanding of structure -function relationships, we have 
taken a three-pronged approach, 

(a) Sequence determination of Epo from mammals of dif- 
ferent orders in order to establish regions of homology (26). 

(b) Construction of a model of the three-dimensional struc- 
ture of Epo, followed by the design and preparation of muteins 
that test this model. These experiments are presented in this 
paper. 

(c) Design and testing of muteins that provide information 
on receptor binding domain (s). This work will be presented 
in a subsequent paper. 

MATERIALS AND METHODS 

Computer-based Modeling of Structure 

Prediction of Secondary Structure — Epo sequences from human, 
monkey, mouse, rat, sheep, pig, and cat were aligned (26) and exam- 
ined using a hierarchical approach to secondary structure prediction 
that assumes that these proteins are members of the a/a folding class 
(27). First, the pattern -based method of Cohen et ai (28) for turn 
prediction was used to delimit sequence blocks likely to contain 
secondary structure. Predictions using the methods of Gamier et ai 
(29) and Chou and Fasman (30) suggested a-helical regions within 
these blocks. Finally, helical wheel projections were used to examine 
and then limit helix length based on preserving amphipathic character 
as codiGed in the work of Presnell et ai (31). The locations of 
glycosylation sites were also used to suggest helix boundaries. 

Tertiary Structure Pre<ftci£on— Eiarlier investigations have re- 
vealed the general principles of helix -to-helix packing in globular 
proteins (32). Exploring these principles, Oiheo et al (33) developed 
a method for the generation of three-dimensional protein structures 
from the secondary structure assignment.' These methods have been 
applied to myoglobin, tobacco mosaic virus coat protein, growth 

* These algorithms are available from Dr. Cohen upon request. 
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hormone, a- and interferon, IL-2, and IL-4 (33-36). 

The algorithm for tertiary structure generation is divided into four 
computations. The program aapatch identifies clusters of hydrophobic 
residues within the putative helices that could mediate helix-helix 
interactions (32). Aafold generates all possible helix pairings accord- 
ing to the location and geometric preferences of the interaction sites. 
Aabuild generates the three-dimensional models of all possible struc- 
tures from the list of helix pairings (from aafold) and subject to steric 
restrictions and geometric constraints on chain folding. In the final 
step, aaoector applies the user-defmed distance constraints {e.g. di- 
sulfide bridges) to the structures generated. At this stage, coordinates 
have been specified only for residues in the core a- helices. For residues 
in sequentially distinct loops, lower bounds on the inter- residue 
distances can be inferred from the relevant helix terminus. 

Preparation of Epo Muteins 

Construction of the Mutagenic/ Mammalian Expression Plasmid — 
A Ml 3 plasmid, containing a 1.4-kilobase EcoK\ EcoR\ human Rpo 
cDNA insert (XHEPO FL12) was a gift from Genetics Institute 
(Cambridge, MA) (8), A 943-bp EcoRl-BglU fragment, corresponding 
to the complete coding sequence of the wild type human erythropoi- 
etin, including untranslated regions 216 bp upstream and 183 bp 
downstream, was inserted into the mammalian expression plasmid 
pSG5 (Stratagene) (37) and named pSG5-EP0/WT. 

Site-directed Mutagenesis— was carried out according to the pro- 
tocol described by Kunkel et al. (38). Single- stranded DNA was 
rescued from the pSG5-EP0/WTphagemid grown overnight in Esch- 
erichia coli CJ236, in 2XYT media containing M13K07 helper phage 
(In Vitrogen) and 70 ftg/ml kanamacyn (Sigma). The resulting uracil- 
containing single-stranded DNA was used as a template for mutagen- 
esis. Oligonucleotides (24-46- mer) were synthesized with their 5' and 
3' ends complementary to the target wild type Epo sequence. A large 
variety of mutations (base substitutions, deletions and insertions) 
were created at the centers of the mutagenic primer sequences. 
Annealing of the phosphorylated primers (10:1 oligonucleotide/DNA 
template molecular ratio) was performed in 10 pi of a 20 mM Tris- 
HCl. pH 7.4, 2 mM MgClj, 50 mM NaCl solution. The reactions were 
incubated at 80 'C for 5 min and then allowed to cool slowly to room 
temperature over a 1-h period. The DNA polymerization was initiated 
by the addition as a mix of 1 /*! of 10 x synthesis buffer (100 mM 
Tris-HCl, pH 7.4, 50 mM MgCij, 10 mM ATP. 5 mM each dNTPs, 20 
mM dithiothreitol). 0.5 ni (8 units) of T4 DNA ligase and luKl unit) 
of T4 DNA polymerase (Boehringer Mannheim). After 2 h at 37 *€, 
80 M I of 1 X Tris-EDTA was added. 5 n\ of the diluted reaction mix 
was used to transform competent E, coli NM522 (ung*, dut*). 

Since a 40-80% mutation yield is normally obtained, four to five 
double-stranded plasmid clones from each reaction were sequenced 
with 7-deaza-dGTP and Sequenase (U. S. Biocliemicals Inc.) (39). As 
a rule, the entire coding sequences of the Epo mutants were examined 
for the presence of unwanted mutation by sequencing or restriction 
enzyme mapping. 

Production of Wild Type and Epo Muteins in Mammalian Cells- 
Cos! cells grown to -70% confluence were transfected with 10 Mg of 
recombinant plasmid DNA/lO-cm dish using the calcium phosphate 
precipitation protocol (40). As a control of transfection efficiency, in 
several experimenU 2 Mg of pCHllO plasmid (Pharmacia LKB Bio- 
technology Inc.) was cotransfected and ^-galactosidase activity meas- 
ured in the cytoplasmic extracts. 

RNA Blot-hybridization Analysis—Totai RNAs were prepared from 
cultured Cos7 cells (41) and 2'ng samples electrophoresed on a 1.1% 
agarose gel containing 2.2 M formaldehyde. Transfer to GeneScreen 
Plus filters (Du Pont- New England Nuclear) and hybridization with 
a ^'P-labeled WT Epo probe were carried out as previously described 
(42). 

Quantitation of Transiently Expressed Recombinant Epos—The 
amount of secreted protein in the supematants of transfected Cos? 
was determined by a radioimmunoassay (RIA). The RIA was per- 
formed using a high titer rabbit polyclonal antiserum raised against 
the human wild type Epo and produced in our laboratory. '**I-Labeled 
recombinant Epo was obtained from Amersham Corp. Details of the 
protocol have been published elsewhere (43). 

fmmunoprecipitation of ^S-Labeled Epo Proteins—Three days 
after transfection, the Cos? monolayers were washed ejrtcnsively with 
1 X phosphate-buffered saline and the cells incubated for 20 min at 
37 'C in 2 ml of Mef/Cys* minimum essential media £agle*s modified 
medium. In each culture dUh. 100 lA of TRAN**S-LABEL (l*S] 
cysteine and l^Slmethionine, -10 mCi/ml, ICN Biochemical) was 



then added. After 2 h. the conditioned media were harvested, and 
cellular extracts were prepared by lysis in radioimmune precipitation 
buffer (50 mM Tris-HCl, pH 8.0. 150 mM NaCl. 0.02% (w/v) sodium 
azide, 0.1% (w/v) SDS, 0.5% (w/v) sodium deoxycholate. l9c (w/v) 
Triton X-100. \ mM phenylmethylsulfonyl fluoride, and 1 Mg/ml 
aprotinin). Samples were precleared with rabbit preimmune serum/ 
protein A-Sepharose CL-4B (Pharmacia) for 2 h. Immunoprecipita- 
tions were performed overnight with our polyclonal antibody specific 
for human recombinant wild type Epo and immunoadsorbed with 
protein A-Sepharose CL-4B. Immunoprecipitates were run on 15% 
SDS-polyacrylamide gels (44) and analyzed by autoradiography after 
treatment with Enhance (Du Pont-New England Nuclear). 

Bioassays — The dose-dependent proliferation activities of WT and 
Epo muteins were assayed in vitro using three different target cells: 
murine spleen cells, following a modification of the method of Krysta] 
(45, 46); murine Epo-responsive MEL cell line, developed by Hankins 

(47) ; and human Epo-dependent UT-7/Epo cell line, derived from 
the bone marrow of a patient with acute megakaryoblastic leukemia 

(48) . After 22-72 h of incubation with increasing amounts of recom- 
binant, proteins, cellular growth was deter n.ir.ed by l^HiLhy.T.idine 
(Du Pont-New England Nuclear) uptake or by the colorimetric MTT 
assay (Sigma) (49). 

Bacterial Expression — The wild type Epo target, corresponding to 
the nucleotide sequence coding for the mature protein, was polymer- 
ase chain reaction -amplified using appropriate primers. In the sense 
primer an Ndel site (CATATG) was placed immediately 5' to the 
Ala' codon of the mature protein. In the antisense primer a B^fll site 
was placed 3' to the TGA stop codon. After enzymatic digestion, the 
5l6-bp polymerase chain reaction fragment was inserted in an Ndel/ 
Ba/nHI-cut pETl6b plasmid (Novagen), which has a T7 promoter 
followed immediately by the lac operator. IPTG induction of trans- 
formed E. coli BL21(DE3)(T7 RNA polymerase*. Ion", ompT") re- 
sulted in high levels of expression of a fusion protein with a 10- 
histidine stretch at the amino terminus. The oligo-His tag allowed 
the binding of the produced (Hisio)-Epo on a nickel affinity resin and 
its elution by increasing imidazole concentrations in presence of 
phenylmethylsulfonyl fluoride (Sigma). Most of the produced protein 
formed insoluble aggregates and was solubilized and affinity-purified 
under denaturing conditions in 6 M guanidine HCl. Oxidative refold- 
ing was performed by overnight dialysis at 4 'C against 50 mM Tris- 
HCl. pH 8.0, 40 nM CuS04» and 2% (weight/volume) Sarkosyl, Soluble 
protein was further dialyzed against 20 mM Tris-HCl, pH 8.0, 100 
mM NaCl, 2 mM CaClj. and subjected to factor Xa (New England 
Biolabs) cleavage to remove the NHj-polyHis sequence. Monitoring 
of the fusion protein following induction and during the various steps 
of purification was done by electrophoresis on a 15% polyacr>lamide- 
SDS gel, stained with Coomassie Brilliant Blue. Alternatively, the 
His -Epo fusion protein was detected on Western blot (50), using a 1/ 
2000 dilution of our WT native Epo polyclonal antibody and a second 
biotinylated rabbit-specific antibody which is detected with a strep- 
tavidin-alkaline phosphatase conjugate (Amersham). 

In Vitro Transcription/TranslationSense and antisense primers, 
creating new BglU sites, respectively. 5' and 3' of the initiator and 
stop codons, were used in a polymerase chain reaction on pSG5- 
EPO/WT template. After Bgai cleavage, the 594-bp polymerase chain 
reaction fragment was subcloncd into pSP64T (51). This SP6-con- 
taining vector provides 5'- and 3 '-flanking regions from Xenopus 
globin mRNA, which allow efficient in w'fro transcription/translation. 
Previous experiments showed poor yields of in vitro translated pro- 
tein, when using the GC-rich natural 5' Epo untranslated region. 
One step in vitro transcription/translation was carried out by incu- 
bation of 1 ;ig of circular p64T-Epo in a 50-mI reaction volume of 
SP6-TnT-coupled rabbit reticulocyte lysate system (Promega), in the 
presence of l^Sjcysteine (1200 Ci/mM. Du Pont-New England Nu- 
clear). In some cases, canine pancreatic microsomal membranes were 
added to the reaction mix. A purified GST-human Epo receptor 
extracellular domain fusion protein (EREx) was a gift from W. Harris 
and J. Winkclman, and the binding of the **S-labeled translation 
producU onto EREx-glutathione agarose beads was performed as 
described (52). 

RESULTS 

Construction of a Model of the Three'dimensioncd Structure 
of Erythropoietin 
From an analysis of the putative Epo helix sequences, 
aapatch identified eight possible helix-helix interaction sites. 
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Table I 

Predicted a helical regions of the mature erythropoietin protein 
Data were obtained using the various algorithms for secondary and 
tertiar>' structure generations described under **MateriaIs and 

Methods." 

— Potential helix-helix 

Helix NH. terminus COOH terminus i^^^.^^^ion aites 

A 9 22 19 

B 59 76 63,67,70,71 

C 90 107 95. 102 

D 132 152 141 



In principle, these sites could be used to generate 1.6 x 10^ 
structures. Of these, only 706 maintained the connectivity of 
the chain and were sterically sensible. These structures resem- 
bled four helix bundles, an increasingly common motif in 
protein structure (53). The structures that were not compat- 
ible with the native disulfide bridge between Cys? and Cysisi 
were eliminated. This reduced the total number of structures 
from 706 to 184 (total computer time approximately 1 h on a 
Silicon Graphics IRIS 4D/35G). The remaining structures 
were then rank ordered by solvent -accessible surface contact 
area, a measure of the validity of model structures. The most 
compact structures were right handed, all an ti -parallel four- 
helix bundles with no overhand connections, but this may be 
an artifact of a failure to add the polypeptide chain that forms 
the loops to the helical core constructed by aobuild. The other 
less compact structures were left-handed four-helix bundles 
with two overhand loops, a topology previously seen in the 
structures of IL-4 and growth hormone. We suspect that this 
is the likely structure for Epo. The consensus for assignments 
of putative a-helices in human Epo are summarized in Table 
I. First, analysis of the topological distribution of known four- 
helix bundle structures indicates that nearly all examples 
have an antiparallel orientetion (53). Second, the left-handed 
four- helix bundles with two overhand connections arrange 
the four amphipathic helices to form a compact hydrophobic 
core. Finally, the AB and CD loop regions of Epo are predicted 
to have ;?-sheet segments analogous to IL-4 and growth hor- 
mone that preserves the compact globular nature of the Epo 
model structure. Fig. 1 shows schematic represenutions of 
predicted topological interactions between the four anti -par- 
allel a-helical bundles. 

Several authors have suggested that the helical cytokines 
form a structural superfamily (34. 54-57). On the basis of 
both the mature protein and the individual a-helices, Epo 
seems to be more closely related to growth hormone, prolactin, 
IL-6, and GM-CSF rather than the other members of the 
helical cytokine superfamily. Nevertheless, recent improve- 
ments in algorithms for the identification of distant evolu- 
tionary relationships between proteins from structural finger- 
prints suggested that it might be possible to align the IL-4 
structure to the Epo sequences. The Eisenberg et al (5S) 
structural environment and 3D-1D profile methods are a 
powerful tool for recognizing that a sequence is compatible 
with a known structure, e.g, a foufhelix bundle. The NMR 
structure of IL-4 from Smith et al (59) was used to construct 
a 3D-1D profile. A mixture of sequences including four helix 
bundles, globins. and non-helical structures were aligned 
against the IL-4 profile. Not surprisingly, the IL-4 structures 
from human and mouse gave the highest scores {2? = 22.8 



' Z scores are used to describe the normalized weight associated 
with a profile score. A distribution is butlt from a collection of 
sequences with a mean Z score of 0.0 and a standard deviation of 1.0. 
Z scores greater than 6.0 are associated with significant alignments. 
Z scores between 3.0 and 6.0 may or may not be structurally relevant. 



and 8-1). However, the other known four- helix bundle cyto- 
kines known to share a similar fold with IL-4, e.g. human 
growth hormone (60) (Z = 2.3) and GM-CSF (61) (Z = 2.3) 
faired no better than some globin sequences (Kuroda's and 
slug sea hare globin. Z = 5.0 and 4.8) that adopt a distinct 
tertiary structure. The results for the human and sheep Epo 
sequences were also ambiguous (Z = 1.6 and 0.8). These 
results suggest that while profile methods are a powerful tool 
for recognizing structural similarity, their failure to identify 
homology does not exclude the possibility that two proteins 
share a common fold. For distantly related or unrelated struc- 
tures, current profile methods cannot replace de novo methods 
for tertiary structure prediction. 

Design and Expression of Epo Muteins That Test the 

Proposed Structure 

To test the proposed four a-helicai bundle structure of 
erythropoietin and at the same time to attempt to locate 
functional domains, we created by site-directed mutagenesis 
a series of deletion, insertion, and replacement mutants. 
These muteins were designed to analyze the principal pre- 
dicted structural features of the molecule: a-helices, intercon- 
necting loops, as well as the NH2 and COOH termini. Struc- 
tural and functional implications of the disulfide bridges and 
the giycosylation sites were also investigated. 

a-He/ices— Short amino acid deletions were prepared in, or 
close to, the predicted A, B, C. and D a-helices. Human wild 
type and muteins were transiently expressed in Cos7 cells. 
Northern blot analyses demonstrated that all the mutant 
plasm ids produced about the same amount of mRNA as that 
of the wild type (data not shown). Yet, no detectable amount 
of Epo protein could be found in the Cos7 supernatants, either 
by radioimmunoassay or by bioassay using various Epo-de- 
pendent cell lines. Table II summarizes these findings. 

An example of SDS-poIyacrylamide gel electrophoresis of 
immunoprecipitants from in vivo ^S labeling is presented in 
Fig, 2. As expected, when Cos? cells were transfected with 
pSG5-EP0/WT, a 35-37 kDa band was detected in the su- 
pernatant. In contrast, the deletion mutants (Table II) could 
be detected in cellular extracts but were not exported from 
the cells. Fig. 2 shows the cytoplasmic retention of the mutein 
A140-144, lacking 4 residues in the middle of the predicted 
D -helix. The apparent molecular mass (~ 28 kDa) is less than 
expected for a 5 -amino acid deletion. Therefore, not only the 
secretion, but also the giycosylation, seem to be impaired. 
None of these muteins had deletion of giycosylation sites. It 
is likely that full giycosylation of Epo requires conservation 
of its molecular architecture. Similar results (reported in 
Table II) were obtained for all the muteins having partial 
deletion of an a-helical peptide segment. 

Because contaminants in crude Cos7 cellular extracts se- 
verely interfere with the radioimmunoassay, no direct Epo 
quantitation was possible. However, aliquots of hypotonic 
extracts of Cos? transfected with wild type Epo were able to 
sustain HDC57 proliferation. No similar biological activity 
was found for muteins with limited deletion of a-helices. 

Interconnecting Loops — The peptide segment joining A- 
and B-helices presents several interesting features (Fig. 2A). 
AB loop consists of 36 amino acids. Two N-glycosylation sites 
and a small disulfide bridge are located in the first half and 
their biological implications will be discussed later. The 
COOH end of the AB loop contains a stretch of amino acids 
that is strongly conserved among mammals (26). Alignments 
of human, monkeys, cat, mouse, rat, pig, and sheep Epos 
showed a consensus sequence: DTKVNFYAWKR(M/I)(E/ 
D)VG (residues 43-5?). Three deletions were constructed: 
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FiC. 1. Model of the three-dimensional structure of erythropoietin. A, ribbon diagram of the predicted Epo tertiary- structure. I he 
lour c.-helices are labeled A-D {magenta): Loops between helices are named for the helices they interconnect. Two regions of extended 
structure which could fi)rm hydrogen bonds between Loop AB and Loop CD are also presented Uyan). N- and 0-glycosylation sites are 
indicated in A'mvi and blue, respectively. Disulfide bonds bridge residues 29-33 in Loop AB, and 7-161 on the NHrterminal side of Heltx A 
and the COOH-terminal side of Heiix D are not shown. S.B.: The loop tracing shown does not represent predicted coordinates. B, schematic 
representation of Epo's primary structure depicting predicted up-up-down-down orientation of the four antiparallel a-helices (boxes with 
arrowhead). This folding pattern is strongly suggested by the large size of the two interconnecting loops AB and CD. The limits of each helix 
were drawn accordingly to Table I. A predicted short region of 0-sheet is delineated by the dashed rectangle. The N-glycosylauon sites are 
represented by the dotted diamonds, and the 0-glycosylation site by the dashed oval. The locations of the two disultlde bridges are shown. C. 
cross-section of the Epo molecule at the level of the four rt-helices. The helical wheel projections arc viewed from the NH, end of each helix. 
The hydrophobic residues, localized inside the globular structure, are indicated by filled circles. The charged and neutral residues (open and 
/^ravcirc/e.s» respectively) are exposed at the surface of the molecule. 



A43-47. A48~52, and A53-57. and transiently expressed in 
Cos7. The amount of muteins detected by RIA in the super- 
natants of transfected cells was 10-40% lower than observed 
with wild type Epo (Fig. 3B). Nevertheless, the three secreted 
muteins were biologically active. However, because A48-52 
exhibited a marked decrease of the specific bioactivity, this 
site was studied in more detail by means of serine replace- 



ments. Krystal ex vivo bioassay as well as HCD57 and UT7- 
Epo in vitro bioassays showed that these Ser mutants had 
biological activities similar to that of wild type (Fig. 3C). 
Therefore, the observed decreases in both RIA and bioassay 
for the three deletion mutants are likely to be the result of 
changes of structural conformation. The long length of loop 
AB may be critical for the up-up-down-down topography. A 
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Tablk 11 

Shart (it'Unians in, ar < /«<sv tn, d -/ir'/a'cs- 
Predicted NH. and COOH lennini 'preach helix are indicated in 
I lie verlical iKtxes. 

Mutants 

S\2 \(') 9 A helix 

1 

Af)5-*)9 '>9 B helix KNA levels comparable lo VVT. 
1 

70 

AiMi-lOD yo C helix No detectable Epo in the Cos? 

supernatant, both by RIA 
and bioassay. 

A 105- 109 1 
1(»7 

^122- lit) 

l.i.'f i:i2 1j helix 

^i-t2 )50 
Al52-lor. 152 

Cellular 

Extracts Supernatants 



1 2 3 4 5 6 7 




21.5- 



Fig. 2. Immunoprecipitalions of wild type Epo and the 
A 140-1 44 mutein. Cos? cells were transfecied with pSG5. pSG5- 
EPO/wt. or pSCJ5-EPO/A140-l44. After 3 days, the cells were met- 
alx)lica!ly laljeled with (^S| methionine and I cysteine, immuno- 
precipiutions of cellular extracts and supernatants were performed 
with our polyclonal antibody, raised in rabbit against the native 
human Ep<j. The immunoprecipiutes were analyzed by SDS-poly- 
acrytattiide gel electrophoresis. Lanes 2-4 correspond to the cellular 
extracts; lanea 5-7 in the culture supernatants from transformed Cos7 
with: plasmid without insert {lanes 2 and 5). wild type Epo {Utnes 3 
and 6), and A140-144 (lanes 4 and 7». Lane I represents the protein 
molecular weight standard. The two arrows show the normal secretion 
of the wild type Epo (35-37 kDa) and the c>loplasmic retention of 
the mutein ^140-144 (-28 kDa). 

shorter AB segment may impose a strain on the interhelical 
connection. Chou and Fasman (30) algorithms predicted a 
short ^-sheet structure from residues 44 to 51 (<Pa>;<Pb>, 
1.005 < 1.196). The presence of a short region of /?-sheei in 
the connection between helices 1 and 2 (A and B) have been 
documented in the analyses of the three-dimensional struc- 
tures of IL-4 (22, 23), GM-CSF (24), and monomeric M-CSF 
(25). In contrast, in human GH a short segment of a-helix is 



found at the same location (20). The siruclure/funclion im- 
plications of these short features are not yet understood. 

Helix B is linked to helix C by a much shorter segment 
(residues 77-89) and contains in its center the third N- 
glycosylation site {Asn'**). When the A78-82 mutein was ex- 
pressed, a secreted protein was detected in the conditioned 
medium and conferred proliferative bioactivity on Epo -de- 
pendent cell lines (see Fig. 8). 

A similar long crossover connection (2;^ amino acids) is 
found between helix C and helix D. In contrast to what we 
previously observed for loop AB, a large deletion of 9 residues 
at position 111-119 or a 7-amino-acid insertion of a myc 
epitope after residue 116 did not affect the secretion of these 
muteins (Fig. 4). Furthermore, these two proteins had normal 
specific activity, as seen by the ratio of bioassay to RIA. Our 
ralDbit polyclonal antibody raised a<;ainsi the native form of 
ilie iiunian wild i\pu fully recognized the iwu mutants, dem- 
onstrating that the overall spatial conformation of Epo was 
well preserved. According to the algorithm of Emini et ai 
(62). the residues 111-119 are predicted lo be at the surface 
of the molecule. Since the iil ll-l 19 mutein is readily secreted 
and has full biological activity, it seems unlikely that the 
putative /^-sheet segment in the CD loop is an important 
determinant of molecular stability. Primary amino acid align- 
ments of mammalian Epo showed a large variation in the 
sequence of residues 116-130, including amino acid deletion, 
insertion, and substitution (26). Surprisingly, when the dele- 
tion ill 22- 126 mutein. which removed the 0-glycosylation 
site (Ser*"***), was transiently expressed in monkey cells, pro- 
tein secretion was inhibited. Both rodents, rat and mouse, 
lack the 0-glycosylation site because of a Ser''*' to Pro replace- 
ment. Furthermore, when a Ser*-^ replacement mutein was 
expressed in normal Chinese hamster ovary cells, (63) or 
when wild tvpe Epo was expressed in cells having a defect in 
0-linked glycosylation (64), neither secretion nor biological 
activity were impaired. Therefore, failure of secretion of the 
^122-126 mutein may be the result of some other structural 
alteration. In particular, the proline residue at position 122 is 
invariant among mammals. 

NHi and COOH Termini — Deletion of residues 2 to 5 only 
slightly affected the processing of a biologically active protein 
(see Fig. 8). This deletion may impair cleavage of the propep- 
tide, therefore explaining the lower yield of secreted Epo 
mutein in comparison to that of wild type. The fact that the 
mature monkey protein has an elongated (Val-Pro-Gly) NH^ 
terminus strongly suggests that the NH^-lerminal part is not 
involved in the bioactivity of the molecule. Further evidence 
comes from the results, reported below, on the N-poly-His- 
Epo fusion protein expressed in E, coli^ and also from the 
identical binding of in uitro translated •*'*'S-Iabeled wild type 
Epo onto EREX-glutathione agarose beads (Fig. 5), with or 
without addition of canine pancreatic microsomal membranes 
which permit cleavage of the propeptide.* 

The COOH-terminal sequence following helix D can clearly 
be divided into two distinct domains, separated by Cys****. The 
residues 151-161 were of special interest because they are 
highly conserved among mammals (26). There are only two 
substitutions: Lys*** is replaced by a Thr in artiodactyls and 
cat. and Ala'^ is replaced by a Val in mouse Epo, Both the 
;M52-155 and the ^156-160 muteins remained in the cytosol 
of the transfected Cos? (Table II). One possible explanation 
is that the residues 152-160 may, in fact, participate in the D 
helix. We predict that Gly'*' is the break point of the struc- 



*AU the mutants described in this paper were subcloned into 
pSPG4T plasmid. Studies of the binding of their translation producU 
to EREx are in process. 
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Fig. 3. Interconnecting ioop AB. A, schemaUc repreaenteUon of the loop AB showing the localization of muteins with vanous delcUons 
and amino acid replacaements. The dashed arrows point to the positions of the serine subsUtuUons (in A4S-62). The two Njglywwylation 
sites are repiesented by the gmy diamonds. The small Cy8**«Cy8*' disulfide bridge is indicated. B, amount and biological activities of secreted 
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ture. However» it is possible that this residue causes only a 
bend in the a-helical structure and helix D may extend to 
Cy". The COOH-terminal part of the protein (residues 162- 
166) is clearly not involved in any structural or functional 
feature. Thus, the deletion of the 4 last amino acids or the 
replacement of residues 162-166 by either a KDEL sequence 
or a poly-histidine sequence* did not modify the specific 



• The poly-Hie tail wild type mutant was purified by means of 
nickel affinity chromatography which enables quantitation of cyto- 
solic-retoined mutants. The (His)e COOH-terminal sequence has 
been appended to all the muteins described in this paper. Experiments 
are in progress to exploit this strategy. 



activity of the erythropoietin (Fig. 6). Radioimmunoassay 
revealed that the secretion of the KDEL-tail rautein in the 
media of transfected cells was 45% less than normally ob- 
tained with the wild type Epo. However, when compared to 
the wild type, this mutein had more biological activity in the 
hypotonic Cos? cell extracts. The. KDEL COOH-terminal 
sequence has been shown to be essential for the retention of 
several proteins in the lumen of the endoplasmic reticulum 
(65). Nevertheless, because of overproduction in transiently 
expressed cells, a large percentage of recombinant protein 
escaped into the media. 

Disulfide Bridges—V/ang et aL (66) demonstrated that the 
biological activity of Epo was lost irreversibly if the sulfhydryl 



muteins. The upper bar graphs show the illative secretion of wild type and loop AB muteins as determined by radioimmunoassay. The lowest 
bar graphs display the calculated specific activity (ratio bioassay/RlA) for each mutein, in compariaon with the value obtained for the wild 
type Epo (ratio = 100%). C. HCD67 cell proliferation as a function of increasing concentration of wild type and serine-substituted Epo 
muteins. HCD57 cells (10*/ml) were cultured for 3 days in a 96-well microtiter plate with media containing increasing concentrations of 
•ecmed proteins. The line graphs show the cellular growth as measured by (*Hlthymidine uptake. The number of viable cells was also 
measured with the MTT colorimetric assay and gave similar curves. In vitro proliferation experiments using the human UT-7 cell line (48) 
and the Krystal assay (45) produced identical results, cpm, counts/minute; mU, milliunit 
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agarose beads. 6 x 10^ count/min of purified ^S-labeled erythropoi- 
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Harris e! ai Identical binding demonstrated that the conserva- 
lion uf iht pru[K'ptide did nut impair the hormone- receptor interac- 
tion. 
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Kic. 6. COOH end of Epo. A. schematic represenution of the 
analyzed muteins. corresponding to the deletion of the four last amino 
acids A 163- 166 and the replacements of the residues 162-166 by a 
KDEL or poly (His) sequences. relative secretion of these muteins. 
The bioacliviiies in the supernatants (/) and the cell extracts (2) of 
transformed Cos7 cells were measured by in vitro proliferation assay 
using HCD57, More KDEL mutant remained in the cytosol of the 
Cos7, when compared with the wild type Epo and A163-166 or poly 
(His) muteins. However, all the analyred muteins had the same 
specific activity as that of the wild type. mU, milliunit. 

groups were alkylated The mature human Epo has two inter- 
nal disulfide bonds: Cys'-Cys'*', linking the NHa and COOH 
termini of the protein and a small bridge between Cys** and 
Cys**. Cys^"* was previously changed to Pro by site-directed 



inutaKenesis, and the resulting protein wa.s reported to have 
jjreally reduced in vitro bioloKicai activity (liT). However, rat 
and niovj.se Epos have the same substitution and yei exhibit 
full cross-species bioaciivity. To resolve the role of the small 
disulfide bridge in human Epo function, we created a C29Y/ 
C33Y double mutation. The resulting muiein was normally 
processed and showed the same in vitro bioactivity as the wild 
type (Fig. 3B).^ Furthermore, the deletion of 5 amino acid 
residues (A32-36) did not impair the secretion of a biologically 
active mutein. These data suggest that only the native and 
fully conserved disulfide bridge Cys'-Cys'*^* is crucial for the 
presentation of the molecular structure of erythropoietin. 

Functional Role of the G/ycoAv/an'on— Natural or recombi- 
nant human Epo is a heavily glycosylated protein; 40% of its 
molecular weight is sugars (11). The protein has three ;V- 
linked oligosaccharide chains, located at amino acid positions 
24 and 38 (in predicted loop AB) and position 83 (in loop 
BC). It has one 0-Iinked carbohydrate chain at position 126 
(in loop CD), which is missing in rodents. The role of these 
sugar chains in the biological activity of the human hormone 
has been extensively studied. Site-directed mutagenesis ai the 
iV-glycosylation sites demonstrated that even though the sug- 
ars were important for proper biosynthesis and secretion, 
their removal did not affect in uitro activity. This finding was 
corroborated by several investigators (fj8. (i9K However, Tak- 
euchi et ai (70) found thai iV-glycanase digestion result.s in 
almost complete loss of biological activity. In contrast, there 
is general consensus that glycosylaiion plays a key role in the 
biological activity of the hormone in vivo. Various reports 
have demonstrated that the .V-linked sugar chains enhance 
the stability and survival of Epo in the blood stream (71, 72) 
and protect the hormone against clearance by the liver (73), 
thereby enabling the transit of the hormone from its site of 
production in the kidney to its target cells in the bone marrow 
(74). 

We expressed the wild type Epo in E. coli. Accordingly, the 
produced protein completely lacks sugar. The pET expression 
system was used and is detailed under "Materials and Meth- 
ods." IPTG induction of transformed BL21(DE3) bacterial 
strain rapidly results in a high level of expression of the poly- 
His Epo fusion protein (Fig. 7, A and B). Aher 3 h of 
induction, we obtained a typical yield of -1 mg of total 
protein/ml of culture. However, the vast majority of the 
expressed protein was present in the inclusion bodies, and 
therefore its solubilization and purification on the nickel 
beads were performed in 6 M guanidine HCK Oxidative refold- 
ing and factor Xa cleavage resulted in soluble forms (Fig. 7C), 
and the in vitro biological activity was tested on HDC57 cells. 
The cleaved E. coli recombinant Epo showed a notable de- 
crease of the specific activity (10% less than the fully glyco- 
sylated mammalian expressed protein), but was still able to 
maintain HCD57 proliferation. The obser\'ed reduction of in 
uitro activity is likely to be due to improper refolding of the 
insoluble protein and impaired physical stability of the E. coli 
Epo as previously reported (55). However, the fact that the 
E, coli Epo was still able to trigger HCD57 growth indicated 
an overall preservation of the molecular structure. The un- 
cleaved fusion protein, with 10 His residues at the NHj 
terminus, exhibited a 67% loss in biological activity when 
compared to the cleaved protein. Thus, the addition of a 20- 
residue sequence to the NHi terminus partially inhibited the 
biological activity. 

•Two single replacement muteins (C29Y and C33Y) were also 
Ktable and had full biological activity (results not shown). 
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Kic. 7. Bacterial expression of wild type Kpo. A. diiijiram of the tusion |)r(>tein. An NHv-ieniiinat 'i'i-.iiniiio-acici loii^' peptide, 
coiuaininj: a H» histidine slreich. was lused lo ihe inaiurt' orylhropoietiti se<juenco. Kitcior Xa cleayaj;^ allowed the rcvvtry of iho mature 
F,p«i with omIv J extra residues at itP amino ternnnus. H. ITTC induction ul the iusion protein. Transformed E. (*>{i MLJUUK-ll cultures 
tOl)..,. = (Mii'wert' jjrown in tlie presencf ol 1 mM II'TC Aiiquois wt-re rolleeted at 0 \ latw 2h 1 ikmr 2 ijanr ; i. and ;{ h ihtnr and 
jinaly/ed oji ;i l.'i'V SDS-polyacr>-iamido geL stained with Cnomassie Brilliant Hhie. A hi^di k-vel ol" protiui-tion (•!* the fusion protein w;i> 
rapidA ulnainHl. Ijirif 6' corri^spttr.ds to an aliquot iVom iranstVirnied hacteria j;r»'wn tur :i li in a medium without II' TC;. iMnr ; is a low 
molfi ular weight standard. (\ purification oCihe fusion protein. After '\ h ol* II* Tt; indiu tiun. tlie produced i His);. . Kpo was .^oluljili/ed in t» 
M i.ni;midine H(.'L and jjunfitd on ;t nickel affinity resin l)y increasinfj imidazole nmceni rations follovsini; the pKT His sy.stem protocui 
iNt)vaf:ent. Samples of the cohmui ehiants were analyzed by Sl)S-[>olyacrylHinide ^el electrophoresis and stainwl with C<Htmiissie Brilliant 
Blue, l.cfu- elution bv 20 mM imida/.ole; /one 2. eUition by HMJ mM imidazole, releasing: the fusif.n protein: Iti/w -i, chelation of the niokel by 
a l(H» niM KirrA wash: lane ^. molecular wvi^'ht standard. /). detection of the K. vfli recombinant Kpo on a Western blot, Solubilized proteins 
were separated on a l.')^'; SDS pnlyacrylamide «el. transferred to a nit rmelluhtse membrane, and pro?K»<J wiih a I/2(H'K) dilution of our native 
wild i>i)e p...tyclnnal antibody, as described under "M;iterials Hnd Meih<Kls." Lanr !. analysis alter v>xidailvt- reduciiun: Uirw J, after dialysis 
a;:ainst the factor Xa buffer; and iatw J. after factor Xa oleava^^e. 



DISCISSION 

Currently, the accrual rate of new protein sequences 
thruujih jrcne cloninjr far outstrips the rate of determinaiion 
itf ihree-diniensional structure. Kpo is among a larjie number 
of hiolojiically itnportani proteins which have not yet been 
analyzed by x-ray diffraction or NMH. The problem is sinv 
[)liried by run lu la live evidence that the si run u res of most 
proteins are likely to be variations on existing themes (27). 
indeed, as mentioned ai)ove, Kpo appears to share common 
structural features with a hufje uroup of cytokines (15-17). 

Computer-based prediction of structure can be reduced to 
a ihree-stape process: secondary structure is predicted from 
the primary amino acid sequence and, w^hen available, optical 
measurements. Analysis of Epo by circular dichroism reveals 
about bO% a-helix and no detectable pZ-sheet (7, 11). With the 
knowledge of disulfide bonds, secondary structural elements 
are then packed into a set of alternative tertiary structures. 
The number of plausible arrangements can l)e reduced by 
empirical knowledge of preferred helix-helix packing {jeome- 
tries and the need for globular structure to form a hydrophobic 
core. The putative tertiary structure is then refined by stand- 
ard force field calculations. Since there are a large number of 
alternate tertiary structures, the availability of experimentally 
determined structure of a homologous protein is critically 
important. Thus, the predicted model of Epo structure gains 
considerable validity by knowledge of the structures of growth 
hormone (20, 21) and IL-4 (22. 23). 

We have tested the predicted four anti-parallel u-helical 
bundle structure by means of site-directed mutagenesis. Dele- 
tions within predicted a-helices would be expected lo desta- 
bilize tertiary structure, whereas deletions or insertions in 
non-helical segments should be permitted unless they impose 



undue strain on the structure. For example, a deletion in an 
overhand inior-helical loop may result in insufficient length 
to connect the two helices. Results that we have obtained on 
muteins produced in mammalian (CosTi cells are summarized 
in Fig. 8. Our measurements of the quantities of processed 
mutein by HI. A nuiy underesiimnie the true amount of se- 
creted P'.pi). Kven a small deletion or insert ion can result in a 
conformational chanfiC that may lead m impaired binding: by 
our polyclonal antibody, raised against native human Epo. 
Thus the values for specific activity (biologic activity/RlA) 
that we report must be regarded as approximations. This 
caveat notwithstanding, our mutagenesis results are in good 
agreement with our proposed four <t -helical model of eryth- 
ropoietin. The proper folding of Epo into its native tertiary 
structure is necessary* for stability and biological function. 
Muleins with short deletions inside predicted a-helices were 
not processed and exhibited no biological activity. In contrast, 
when deletions were created in predicted interconnecting 
loops, secreted proteins were detected, to varying degrees, 
both by radioimmunoas.say and l)ioassay. Furthermore, addi- 
tions or deletions at the NH.. or COOH termini did not 
markedly impair the secretion and the biological activity of 
the Epo protein. Moreover, mutations at Cys'^' and Cys*^* 
showed that the small disulfide loop is not critical for biolog- 
ical activity. In order to delineate Epo's functionally impor- 
tant residues involved in the direct binding onto the Epo 
receptor, we have prepared and tested a series of amino acid 
replacements on the surfaces of the predicted a-helices. These 
experiments will be described in a subsequent paper. 

Acknowted^ments—We thank Elizabeth Eldridge for technical as- 
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Fig. 8- Relationship between production of muteins and proposed secondary structure. This bar graph shows the amount of 
secreted proteins in the supernatants of transiently expressed Epo mutants, as detected by radioimmunoassay. The muteins were aligned 
over a schematic representation of the native Epo molecule. Each deletion is shown as a stippled bar, the width of which is proportional to 
the number of residues deleted. The four a-helices are represented by the black rectangles. The two disulfide bridges are indicated. These 
mutagenesis results are in good agreement with our proposed four « -helical model of Epo. 
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Fine-Structure Epitope Mapping of Antierythropoietin Monoclonal 
Antibodies Reveals a Model of Recombinant Human 
Erythropoietin Structure 

By Steve Elliott, Tony Lorenzini, David Chang, Jack Barzilay, Evelyne Delorme, James Giffin, and Lyndal Hesterberg 



We have isolated and mapped the rHuEPO epitopes for three 
noncompeting anti-EPO monoclonal antibodies (MoAbs). 
The MoAb 9G8A recognizes a linear epitope that includes 
amino acids 13, 16, and 17. MoAb F12 recognizes a confor- 
mational epitope that includes amino acids 31 through 33, 86 
through 91, and 138. MoAb D11 recognizes a conformational 
epitope that includes amino acids 64 through 78 and 99 
through 110. MoAb Oil neutralizes rHuEPO activity which 

THE ERYTHROPOIETIN (EPO) gene has been cloned 
and biologically active material (rHuEPO) has been 
produced in mammalian cells.'"* The gene encodes a 193 
amino acid protein." ' The secreted protein is 165 amino acids 
with approximately 40% carbohydrate.^^ Several models of 
EPO structure have been described*"' but there is no available 
nuclear magnetic resonance (NMR) or crystal structure. One 
model' is supported by analysis of the effect of deletions on 
secretion of EPO. In this case it is assumed that deletion of 
helices, but not connecting loops, will result in a decreased 
rate of secretion because of a loss of EPO structure. Unfortu- 
nately there is little additional information to test existing 
models or to develop new ones. Therefore we decided to 
develop an immunological approach to determine rHuEPO 
structure. 

Immunological studies can be used to determine structural 
and functional information about proteins. For example, 
mapping of epitopes recognized by neutralizing antibodies 
reveals information about the active site. Conformational 
epitopes require proper folding to juxtapose the amino acids 
involved in antibody binding. Therefore, epitope mapping 
by antibodies that recognize conformational epitopes reveals 
structural information about the antigen because amino acid 
residues close together in space are identified. When epitope 
mapping by several different antibodies is done it is possible 
to determine the folding pattern of the molecule. Additional 
consu-aints on structure can be obtained if the antibodies 
recognize nonoverlapping epitopes. The epitopes must there- 
fore be separated from each other in the structural model. 
This type of approach has been used successfully to develop 
a model of human growth hormone structure.'" 

There are many different approaches to mapping of epi- 
topes. Antibody binding to synthetic peptides or peptide frag- 
ments from the antigen can localize the binding site. This 
approach is usually successful only for antibodies that recog- 
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suggests that its epitope may contain the receptor binding 
domain. Analysis of the effect of mutations on folding al- 
lowed the identification of buried residues, a-helical, and 
non a-helical regions. This data along with epitope mapping 
data of anti rHuEPO monoclonals was used to model rHuEPO 
protein structure. A model consistent with the data is a 4- 
helix bundle with short and long interconnecting loops. 
<D 1936 by The Americat} Society of Hematology. 

nizc linear epitopes. Another approach is to deicrminc anti- 
body binding to naturally occurring vananis, such as proteins 
from different species. However, such variants may not be 
available or the variants may have too many amino acid 
differences to unambiguously identify those important for 
antibody binding. In addition, this approach may also fail if 
the antibody recognizes a particular conformation and the 
variants have different conformations. A third approach is 
to construct variants with single amino acid substitutions by 
recombinant DNA methodologies and then determine which 
mutations result in reduced antibody binding. This approach 
can result in false positives if the mutation alters folding in 
a way that affects antibody binding. However, it has the 
advantage that mutations in residues that are part of the 
epitope result in reduced binding even for antibodies that 
recognize conformational epitopes. Thus, if one can distin- 
guish between specific and nonspecific effects on antibody 
binding, the epitope can be determined. 

Our approach to mapping antibody epitopes was to con- 
struct rHuEPO variants and measure antibody binding to 
them. We used two methods to overcome the possibility that 
mutations would result in false positives because of changes 
in rHuEPO conformation that indirectly affected immu no re- 
activity. In many cases multiple substitutions were made ai 
each position to increase the probability that a mutation could 
be identified that had a minimal effect on protein folding. 
Secondly, we also attempted to identify those mutations that 
resulted in altered protein folding. This was done by a.ssaying 
the rHuEPO variants with conformation sensitive antibodies 
including the anti-EPO monoclonal antibody (MoAb), 
9G8A. 9G8A immunoreactivity increa.ses when rHuEPO is 
denatured." Thus, mutations that reduce MoAb immunore- 
activity because ihey cause gross conformational changes 
were identified. Another benefit of this approach is that pre- 
dictions of secondary structure can -be made according to 
the effect of mutations on folding. For example, it has been 
reported that mutations in connecting loops have little el feet 
on structure while mutations in a-hclical or /?-sheet regions 
alter structure.'* ** Wc have developed a model of rHuEPO 
structure by using the epitope mapping data in combination 
with predictions according to effects of amino acid substitu- 
tions on folding. 

MATERIALS AND METHODS 

Materials. EPO peptides were synthesized on an Applied Bio- 
systems (Foster City. CA) model 430A peptide synthesizer. The 
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isolation and characterizaiion of the MoAbs F12, DIl, and 9G8A 
has been described." The EPO standard used in immunoassays is 
recombinant human EPO produced in Chinese Hamster Ovary cells. 
'"I EPO was obtained from Amcrsham (Arlington Heights, IL). 

Radioimmunoassay (RIA) and enzyme-linked immunosorbent 
assay (EUSA ). RIA-P uses an anU-EPO polyclonal antibody raised 
in rabbits and has been described previously." RIA-N'^ uses a rabbit 
polyclonal antibody raised against a synthetic peptide (amino acids 
I through 20 of human EPO). This assay appears to be largely 
independent of folding of rHuEPO. The ELISA assays used immobi- 
lized MoAbs (F12 or Dl 1) as capture antibodies, A secondary anti- 
body raised in rabbits (R) or Goats (G) was labeled with horse radish 
peroxidase and was used as a signal antibody. RIA assays with 
9G8A and EUSA assays with FI2 (EIA/F12) and Dl 1 (EIA/Dl I ) 
arc described in an accompanying report." 

Epitope mapping of 9G8A with peptides. Synthetic EPO pep- 
tides, in 0-05 mol/L carbonate/bicarbonate pH 9.2 buffer, were bound 
to polystyrene wells (Costar, Cambridge, MA) at room temperature 
(RT) for 2 hours. The plate was incubated at RT for 2 hours then 
overnight at 4*C. The next day the wells were blocked for 30 minutes 
at RT with phosphate-buffered saline (PBS) (10 mmol/L Na - P04/ 
150 mmol/L NaCI pH 6.8) containing 5% bovine senim albumin 
(BSA) followed by a 2-hour RT incubation with protein A agarose 
purified 9G8A at a concentration of 5 ug/mL in PBS containing 1% 
BSA. After washing with KPL washing buffer (Kirkegard and Peny 
Labs, Inc Gaithersburg, MD) the plate was incubated with a 1:1,000 
dilution of goat-antimouse IgG conjugated with horse radish peroxi- 
dase (Boehringer Mannheim, Indianapolis, IN) for an hour at RT. 
The plate was then washed and developed with ABTS (2,2'-azino- 
di-[3-ethylbenzthiazoline sulfonate]) (Kirkegard and Perry Labs, Inc) 
substrate solution. Colorimetry was conducted at 405 nmol/L. 

Competition a.ssays with synthetic peptides used for 9G8A epitope 
mapping were performed by immobilizing 50 ng EPO in carbonate 
buffer for 2 hours at RT to polystyrene wells. The contents were 
then dumped and the wells were blocked for 1 hour with PBS/5% 
BSA. After removal of (he contents, 50 pL peptide and 50 9G8A 
(100 ng/niL) were simultaneously added. Incubation was for 3 hours 
at 4''C. The wells were then washed with KPL wash buffer, incubated 
with '^^1 labeled goat-antimouse antibodies, washed again, and 
counted. 

Construction of EPO variants. All rHuEPO variants were ex- 
pressed from an human EPO cDNA clone'* in an S vector.' To 
construct rHuEPO variants, a DNA segment containing EPO coding 
sequences was transferred to MI3mpl8'^ and single stranded DNA 
was produced in a dut ung Escherichia coli strain (RZI032) as 
described.'* *' The single-stranded DNA was annealed with synthetic 
oligonucleotides containing the desired mutations and then was ex- 
tended with Klenow polymerase in the presence of nucleotide tri- 
phosphates and T4 DNA ligase. After an overnight incubation, the 
reaction mixture was transfomned into wild type E coli (JMI09;"). 
Transfcctants were screened with the mutant oligonucleotides to 
identify mutants. The presence of each mutation was confirmed by 
DNA sequence analysis then the DNA segment containing the de- 
sired mutation was transferred back to the original SV-40 vector. 

Transfection of COS- 1 cells. COS- 1 cells were Iransfecled by 
one of two nwthods. The calcium phosphate method has been de- 
scribed previously.*^ To transform by clcctroporaiion, cells were 
procured from scmiconfluent dishes, washed in Dulbecco*s modified 
Eagle's medium (5% fetal calf serum, 1% L-glutaminc/penicillin/ 
streptomycin. Irvine Scientific, Irvine. OA) and resuspended ai 4 x 
10* cells per mL. One mL of cclb was transferred to an electropora- 
tion cuvette, and elcctroporatcd at 25 /ifarads and 1,600 V in the 
presence of 2 through 20 /ig plasndd DNA. The clectroponited cells 
were then plated at 2 x 10* cells per 60mm tissue culture dish in 5 



mL of medium. Two to 4 hours later the medium was removed and 
replaced with 5 mL fresh medium. With both u^sfeciton methods 
the conditioned medium (approximately 5 mL) was collected after 
3 to 5 days. Aliquots were made and stored at -l(fC. 

RESULTS 

Construction of rHuEPO variants for epitope mapping. 
To identify amino acids involved in antibody binding we 
constructed 207 rHuEPO variants with individual point mu- 
tations that altered 131 of the 165 amino acid positions in 
rHuEPO as well as 2 positions in the signal peptide (Figs 1 
and 2). The largest space between mutations was 3 amino 
acids. At 56 anuno acid positions, 2 or more variants with 
different substitutions were made. In general, the mutations 
were nonconscn'ative substitutions. This was done to ensure 
that if a region was important for antibody binding or 
rHuEPO structure, the mutation would result in an effect on 
antibody immunoreactivity. In cases where the nonconscrva- 
tive substitution was thought to affect protein foldings addi- 
tional conservative substitutions were also made. With this 
approach we hoped to identify structural elements in 
rHuEPO by distinguishing between regions that could and 
could not tolerate nonconservative substitutions and still 
have confidence that we could identify antibody epitopes. 

Expression of rHuEPO variants and determination of 
concentration. Because of the large number of rHuEPO 
variants needed for epitope mapping and the desire to in- 
crease the probability that proteins would be properly folded, 
we decided to express the variants transiently in COS cells 
and collect and assay conditioned medium containing se- 
creted rHuEPO. In many cases several independent transfec- 
tions for each rHuETO variant were done. Because of the 
low expression levels in this system as well as the large 
numbers of variants, it was not practical to purify each vari- 
ant. We chose instead to determine the concentration of 
rHuEPO in each sample directly by immunoassay. 

The concentration of each rHuEPO variant in supematants 
was determined by five different immunoassays including 
RIA-P. RIA-N. EIA/Dl I. EIA/F12. and RL\-9G8A immu- 
noassays. ELISA assays were performed using Dl 1 and F12 
as primary MoAbs in combination with rabbit (R) and. in 
some cases, goat (G) horseradish peroxidase conjugated sig- 
nal polyclonal antibodies." The use of two different second- 
ary antibodies raised in different species reduced the likeli- 
hood that low immunoreactivity in ELISAs was because of 
reduced inumunoreaciivity by the secondary antibody. The 
redundancy in assays allowed increased confidence that at 
least one assay could be found that was largely unaffected 
by the mutation. The fact that 9G8A, F12, and Dl 1 all recog- 
nized nonovcrlapping epitopes suggested that a rHuEPO 
variant with a mutation that affected the epitope for one 
antibody would not reduce the immunoreactivity of the other 
immunoassays. This would be especially true for mutations 
having little or no effect on rHuO*0 conforraalion. In addi- 
tion two different RIA assays using polyclonals were also 
peifoFraed. RIA-P uses a polyclonal antibody raised to 
rHuEPO, thus most rHuEPO mutations would be unlikely 
to directly affect immunoreactivity with this assay. RIA-N 
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Fig 1. Fine structure epitope 
mapping of anti-EPO MoAb F12. 
Amino acid residues in the hu- 
man EPO sequence (gray tetters) 
and EPO variants (adjacent to 
human sequences (black let- 
tersl) are indicated. Mutations 
connected by a line indicate that 
both were made simultaneously. 
Branched symbols at amino 
acids 24, 38, 83, and 126 repre- 
sent the locations of glycosyta- 
tion sites. The shape of the sym- 
bols surrounding the mutations 
indicate the effect on 9G8A im- 
munoreactivity. Shaded sym- 
bols indicate mutations that 
result in decreased F12 tmmuno- 
reactivitY. Immunoreactivity was 
determined as described in the 
text. 



u.'^ed a polyclonal antibody raised to an amino terminal (aa 
1-20) HuEPO peptide. This assay is largely unaffected by 
rHuEPO conformation and. in most cases, estimates of EPO 
concentrations determined by this as.say arc likely to reflect 
true concentrations. 

It is likely thai many mutations in rHuEPO would alter 
the folding. Furthermore the fact that both F12 and Dll 
recognize conformational epitopes suggests thai some 



rHuEPO mutations would indirectly affect the immunoreac- 
tivity to these monoclonals. To identify such mutations, sev- 
eral analyses were performed. First, the concentration of 
rHuEPO in supernatant was measured by an assay thought 
to be unaffected by ihe mutation to sec if the secretion level 
might be lower than expected. Reduced secretion is thought 
to be indicative of altered or incomplete folding. In numerous 
experiments with native, sequence rHuEPO we found that 
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Ftg 2. Fine structure epitope 
mapping of antt-EPO MoAb D11. 
Amino acid residues in the hu- 
man EPO sequence (gray letters! 
and EPO variants (adjacent to 
human sequences (black let- 
ters]] are indicated. Mutations 
connected by a tine indicate that 
both were made simuttaneously. 
Branched symbols at amino 
acids 24, 38, 83. and 126 repre- 
sent the locations of glycosyla- 
tion sites. The shape of the sym- 
bols surrounding the mutations 
indicate the effect on 9G8A im- 
munoreactivity (fold increase 
relative to RIA-N and or other im- 
munoassays). Shaded symbols 
indicate mutations that result in 
decreased Oil immunoreactiv- 
ity. Immunoreactivfty was deter- 
mined as described in the text. 
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conditioned incdiuin from COS cells contained 10 to l(X) U/ 
niL rHuEPO. Therefore, levels less than 5 U/mL were 
thought to be indicative of altered folding and thus reduced 
secretion. Secondly, immunoassays thought to be affected 
by changes in rHuEPO conformation were examined. RIA-P 
can have ivduced apparent immunoreactivity whereas RIA- 



9G8A immunoreactivity has increased immunoreactivity 
when rHuEPO conformation is altered." In ca.ses where 
RIA-P was low and 9G8A was high, EIA/FI2-R and EIA/ 
Dl l-R were also often low. In these cases RIA-N immunore- 
activity wa.s usually much higher than with any other immu- 
noa.s.say. Thus, we concluded that rHuEPO conformation 
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Table 1. 



Immunoreactivitv 



EIA-Fl2t EIA-D11S 
RIA (U/mL)t (U/mL) (U/mLl 



Mutation 


WT AA 


P 


N 


9GaA 


R 


G 


R 


G 




wt 


33 


26 


31 


35 


35 


36 


39 


wt 




66 


60 


53 


73 


60 


70 


71 


pro-2* 


leu 


29 


<0.05 


57 


40 




26 




asn4 


arQ 


3 6 


2.3 


9 


3.8 


3.4 


3.4 


2.9 


serS 


leu 


6.2 


<1.1 


8.5 


10.7 




12.2 




ser7 


cys 


0 6 


12 


263 


0.2 


0.6 


0.4 




ala12 


leu 


1 8 


2.8 


12 


1.7 




1.7 


2.9 


8la13 


giu 


4.4 


1.9 


2.3 


4.7 


3.4 


3.8 


0.9 


ser 13 


giu 


12 6 


14.8 


<- ^ 


14.2 




1 1.1 




gln14 


arg 


19 






20 




20 




ile 15 


tv^ 


10 


12 


3 


1 \ 




3.3 


12 


ser 16 


leu 


« 


12.9 




12 2 




1 -) 




«er17 


leu 


8 -8 


1 1 .7 


3 8 


12 7 




12 4 






cys 


1.4 


2.8 


39 


<0.07 


<0.06 


1.7 


3.0 


ile32 


his 


10 


10 


10 


0,6 




n 


10 


pro33 


cys 


2.3 


12 


280 


<0.2 


<0.5 


2.8 


15 


Ber70 


leu 


<0.14 


0.38 


4 


<0.06 




<0.06 




tys72 


glu 


15 


10 


n 


17 


17 


<0.15 




ser79 


ata 


3.7 


2.8 


34 


2.9 




2.1 


2.6 


ser81 


leu 


9.6 


22 


290 


<0.0€ 




10.7 


10.3 


glySe 


gin 


47 


34 


172 


<0.06 




51 


37 


glulOO 


ser 


82 


63 


125 


84 


84 


0.7 


<1 


alalOS 


arg 


7.9 


6.9 


7 


8.2 




<0.06 




tie 104 


ser 


21 


18 


56 


24 




23 




series 


leu 


0.17 


1.9 


32 


0.1 




0.2 


<0.3 


gtulOe 


thr 


28 


31 


108 


27 




<0.06 




ite138 


phe 


5.6 


3.4 


76 


0.19 




4.8 


7.1 


val138 


phe 


1.6 


1.7 


7.0 


<0.07 




0.9 


1.3 


phe145 


tyr 


25 


25 


305 


23 




29 


38 


val148 


phe 


0.3 


2.5 


69 


<0.06 




0.3 




alalSI 


giy 


5 


24 


203 


6 




4 


8.6 


tie 156 


tyr 


6.5 


28 


468 


7.7 


10 


19 


20 



• Number represents amirK) acid position in rHuEPO. Pro'* is a mutation in 
the signal peptide. 

t RIA was performed as described in Materials and Methods using either (N) 
N-temiinal antibody (PI polyclonal antibody or (9G8A) MoAb. 

I ELISA with MoAb F12 were performed as described in Materials and Methods 
using either (R) Rabbit polyclonal secondary antibodies or {G) Goat polyclonal 
secondary antibodies. 

5 ELISA with MoAb Dl 1 were performed as described in Materials and Methods 
using either (R) Rabbit polyclonal secondary antibodies or iG) Goat polyclonal 
secondary antibodies. 



was altered. RIA-9G8A/total ratios of 0.7 to 2.5 were found 
with samples containing folded rHuEPO and ratios of 20 
through 40 correlated with completely denatured rHuEPO, 
Total antigen concentration was calculated for each 
rHuEPO variant and was the average of all the assays that 
were unaffected by the mutations. Ratios were then calcu- 
lated for each inimunoassay; immunoreactivity measured 
with each assay divided by the total antigen concentration. 
As an example of this analysis a Cys" to pro substitudon 
resulted in secretion of 12 U/tnL as measured by RIA-N and 
15 U/mL as measured by EIA/Dll-G (Table 1). Therefore 
we assume the rHuEPO concentration was 12 to 15 U/mL. 
This concentration is reasonable because it was in the normal 
range. RIA-P immunoreactivity was 5X lower and 9G8A 
immunoreactivity was 23x higher suggesting thai rHuEPO 
conformation is altered with this mutotion. This is not unex- 
pected because Cys^ is pan of a disulfide bond that may be 
important for rHuEPO structure. F12 immunorcaciivity was 
low suggesting that this assay is also affected by the muta- 



tion. Immunoreactivity measured by EIA/Dl 1-R was lower 
than that of EIA/D 1 1 -G. This suggests thai monoclonal D 1 1 
and the goat secondary antibody were capable of binding to 
this rHuEPO variant but the immunoreactivity measured by 
the rabbit secondary antibody was reduced. Using this type 
of analysis, we could determine both the concentration of 
each variant in samples and the degree to which each muta- 
tion affected immunoreactivity and the conformation of 
rHuEPO. Reductions of 30% or more in apparent F12 or 
DU immunoreactivity were considered significant. Muta- 
tions thought to change rHuEPO conformation were defintrd 
as those that increased 9G8A immunoreactivity by more than 
2.5 times expected values. 

Analysis of RlA-N immunoreaaivixy. The antibody used 
in RIA-N was raised against a peptide containing the fir^t 
20 amino acids of HuEPO.'^ It appeared to react normally 
to all the rHuEPO variants with mutations between 21 and 
166 because immunoreactivity was always equal to or higher 
than immunoreactivitics measured with all other immunoas- 
says except RLA-9G8A. Although the antibody in RIA-N is 
a polyclonal it is possible that rHuEPO variants with muta- 
tions in the amino terminal 20 amino acids may affect RIA- 
N immunoreactivity. Mutations at amino acid residues 6. 7, 
8. 10. 1 1, 12, 14, 15, 16, 17, 18, and 20 all gave equal or 
higher EPO concentrations measured with RIA-N, to thai 
obtained with all other immunoa.ssays except R1A-9G8A. 
However, compared to some other immunoassays a Ser^ 
substitution reduced inrununoreactiviiy over 10-fold and Asn"* 
and Asn** mutations reduced immunoreactivity approxi- 
mately 30 to 35%. An Ala'-^ substitution reduced RIA-N 
immunoreactivity approximately twofold whereas a Ser'^ 
mutation had no apparent effect. It has been reported else- 
where that immunoreactivity to monkey EPO with RIA-N 
is reduced 150-fold." Monkey EPO is extended by 3 amino 
acids (V-P-G) because of altered processing of the signal 
peptide.'^' presumably because of the pro residue at -2. Mon- 
key EPO differs at one other position in the first 20 amino 
acid residues, position 16. However, a rHuEPO Leu'" to Ser 
variant appears to be recognized normally by RIA-N (Table 
I). To determine whether alterations at the amino terminus 
explain the reduced immunoreactivity to monkey EPO, a 
rHuEPO variant with a pro substitution at -2 was con- 
structed and assayed. Immunoreactivity with RIA-N was re- 
duced over 500-fold with this variant (Table I ). The apparent 
immunoreactivity using other immunoassays suggested that 
this mutant was secreted normally (26 to 40 U/mL). In addi- 
tion, the immunoreactivity with 9G8A was also in this range 
59 U/mL. These results suggest that this protein is secreted 
and folded normally. A different substitution^ Leu to Val at 
- n , in the signal peptide had no apparent effect on immuno- 
reactivity with RIA-N (dau not shown). These results sug- 
gest that processing of the signal peptide may be altered by 
a pro substitution at -2 and this alteration affects RIA-N 
immunoreactivity. Thus, the only mutations that dramati- 
cally affected RIA-N (> 10-foId decrease) were at the amino 
terminus and position 5. Residues C-terminal to position 5 
appeared to affect RIA-N modestly (twofold or less). This 
suggests that the major epitope for RIA-N is contained 
within the first 5 residues. 
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Fig 3. Epitope mapping of anti-EPO MoAb 9G8A 
with peptides. The indicated peptides at the concen- 
trations shown were bound to plates then incubated 
with 9G8A and '"I labeled antimousc IgG. Bound 
''^l was determined as described in Materials and 
Methods. 
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Mappinii of the 9GHA epitope. The observation lhat 
9GSA can recognize EPO on Western blots and would react 
with denatured EPO in RIAs suggested that 9G8A recog- 
nized a linear epitope. Therefore, we measured 9G8A bind- 
ing to .synihelic peptides with the hope lhat we could deter- 
mine ihe apprt).\iniate KK*ation of ihe epitope. A collection of 
overlapping synihelic EPO peplitles were bound to microliter 
plates and then 9G8A was added lo see if any specific anti- 
body binding could be observed as described in Materials 
and Methods. One (x;piide that included amino acids 1 
through 20 gave a clear positive signal {Fig 3). In this experi- 
ment the maximum signal was obtained even with the lowest 
amount of peptide. All the other peptides resulted in low 
levels of binding including a peptide containing residues 10 
through 28. To further substantiate the role of the amino 
terminal region in 9G8A immunorcaeiiviiy. the ability of the 
l>eptides to inhibit binding of rHuEPO in RIA-9G8A was 
measured. As shown in Fig 4 t>nly the peptide containing 
amino acid residues 1 through 20 would inhibit rHuEPO 
binding and it did so in a dose dependent manner. These 
results suggest that residues between I and 20 contain the 
9G8A epitope. 



To localize the 9G8A epitope further the 2()5 rHuEPO 
variants shown in Figs I and 2 were examined lo see which 
liad reduced 9G8A immunoreaciivity. As expected no muta- 
tions in residues between 21 and 166 reduced 9G8A imniu- 
noreactiviiy relative to other assays. However. 3 positions 
in the I through 20 region appeared to be imponani for 
inimunoreactivity including Glu'\ Leu'", and Leu'^ (Table 
1, Fig 5). The Ala" mutation resulted in a twofold decrease. 
Ser'-"io over 10-fold decrea.se. Ala'" to a 10-fold. Ser'" lo 
an eightfold, and Ala*' and Ser'' lo ihreefold decreases in 
apparent 9G8A inimunoreactivity. Mouse EPO differs from 
human EPO only at po.siiion 16 in the tirst 20 residues and 
9G8A immunoreaciivity lo mouse EPO is reduced approxi- 
mately 500-fold." This is consistent with the suggestion that 
the major epitope for 9G8A includes residues 13 lo 17. 

Fine structure epitope mappiiif^ of Mo Ah ri2. Results 
from the accompanying report" indicate that MoAbs Dll 
and F12 recognize conformational epitopes. Thus, atlempis 
to map the epitopes with .synthetic peptides were unsuccess- 
ful. We cho.se instead to determine the antibody immunore- 
aciivity to closely related rHuEPO variants using the strategy 
previously desciibed. Mutations at 21 of ihc 131 amino acid 



300 



Fig 4. InhibHion of rHuEPO binding in 9G8A im- 
munoasuy by synth«tic HuEPO peptides. R1A-9G8A 
was performed by Immobilizing rHuEPO to a microti- 
tar Plata. Ttia plates ware then Incubated with the 
indicated amounts of EPO peptides and 9G8A. The 
relative amount of 9G8A subsequently bound (CPM) 
to plates was determined by a final incubation with 
ill ^labeled goat antinwuse IgG as described In Mate- 
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rials and Methods. EPO Peptides were aa 1 through 
20 (■). 41 through 57 tOK 56 through 80 (♦), and 
144 through 166 (4). 
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Fig 5. Helical wheel presentation of amino acids 62 to 79 (HELIX 
B). Amino acid residues (closest to the circle! represent those found 
in the human EPO sequence. Arrows point towards mutations (in 
t>oxe$). Mutations that reduced D11 immunoreactivity are shaded as 
indicated. The effect of each mutation on 9G8A immunoreactivity i* 
shown in parentheses. 9G8A immunoreactivity values are shown as 
percent of total antigen present as determined by immunoassay (see 
text). The predicted buried and exposed surfaces of the helix are 
shown. 



positions in rHuEPO tested rcsiiltctl in reduced FI2 imnuino- 
reactiviiy. Examples that show reduced FI2 immunoicacliv- 
iiy are shown in Table I. These and other mutations that 
caused reduced FI2 immunoreactivity are indicated in Fii! 
I . The mutations that affected Fl 2 immunoreactivity inappcd 
to amino acid residues 7. 29 to 3.^. 42. 7(1 XI. 86 to S9, 91. 
97. lOri. 1.^7. I3S. 148. :md \56 and 161. 

Several of the mapped positions are unlikely to be pan of 
the FI2 cpilo|>e. Residues 7 and 161 make up a disulfide 
bond imix>rtanl for stmcture and activity." The mutiuions at 
Pro-*-, Leu'", Leu*", Lys^'. Thr'*' and Phe'^** and Tyr''" re- 
sulted in high 9G8A immunoreactivity, low RIA-P immuno- 
reaciivity and, for most, were secreted poorly. This suggests 
that muialions in these residues afl'ected protein folding and 
therefore the.se positions may not be pan of the FI2 epitope. 
Variants with mutations at Leu"*' and Phc'^" also had high 
9G8A immunoreactivity suggesting that rHuEPO structure 
was affected by the mutations. However, Val"*" and lie'*'' 
variants were recognized similarly by RIA-N. RIA-P. El A/ 
DM. and the Scr*' variant was secreted at nonnal levels. 
Thus mutations in these positions affect rHuEPO struct u a* 
somewhat. However, immunoreaclivily was reducetl .^O-fold 
or more with Fl 2 which suggests that residues 81 and l.'^S are 
cU>se lo or pan of the epitope. In support of this possibility is 
the observation that other mutations affected folding (over 
lO-fold increases in 9G8A immunoreactivity) but they did 
not affect FI2 immunoreactivity eg, Ser\ Val^\ Ile^^ and 
Ile'^\ Mutations in the other two regions with low FI2 im- 



iiumorcactiviiy. amino acids .^I. .^2. and 87. 88, had nornial 
RIA-P. DM, and 9G8A inimunorcacii\ iiy. Thus, these icsi- 
dues appear to be directly involved in 1*12 binding. Residues 
29 and 33 make up a disullide bond. iMu tat ions in these 
residues result in reduced FI2 immunoreactivity but also 
have low RIA-P and increased 9G8A immunoreactivity su di- 
gesting an effect on folding. Given I lie closeness to residues 
thought to interact directly with F12 (31 and 32) we cannoi 
rule out their interaction with FI2 as well. 

/•■///(' stmcture <'pit(ff>e nutppin^i <\i 1)1 1. The EPO vari- 
ants were al.so tested as previously described for Dl 1 ininui- 
noreaciivity to identify its epitope. Mutations at 21 of the 
131 amiiui acid positions tested resulted in decreased appar- 
ent Dl 1 imnuMioreaetiviiy (Table 1 and Fii: 2). These wore 
locali/od to two main rei:i<ins: aminn acitls 64 ihrouyh 8 1 
and 96 through 110. A Ser muiatit>n at Cys"" and a Thr 
mutation at Leu*" also affected i:>l 1 immunoreactivity." 
However 9G8A immunoreactivity was high and RIA-P was 
h)w, Cys"'' forms a disullide bt>nd with Cys'. Thus, muta- 
tions at both positions should result in decreased immunore- 
activity with Dl I if the decrease was because of disruption 
of the disullide bond. Howexer. a Cys to ,Scr mutation ap- 
pears not to affect Dl l-G (goal secondary aniilxKly ) immu- 
noreactivity but Dl l-G immuni)reaetiviiy is reduced at least 
20- fold with Cys'**' mtiialions." This suggests that Cys'"' 
may be clo.se to or pan of the DM epitope whereas Cys is 
not. Alternatively, the Ser""' mutation may be more destabi- 
lizing than the Ser' mutation. In support of the latter possibil- 
ity is the obscr\'ation that the Ser" ' variant is consistently 
.secreted at much lower levels than the Ser^ variant." 

Identification of amim* acnis that may he pan of a h\ - 
(htf phobic core. Certain mutations in rHuEPO appeared to 
severely affect structure as evidenced by three criteria; poor 
secretion, low RIA-P immumireaetiviiy. and 10-fold or 
greater than expected 9GSA immunoreactivity. These muta- 
tions also resulted in low in vitro bioactivity (tiata not 
shown). The residues with this properly include Asn^\ Leu" . 
Leu'". Leu'"' . Phe"\ Lys'- - and Tyr"''. These positions con- 
tained mostly hydrophobic residues and substitutions of j^o- 
lar residues had much greater effects than did substitutions 
with hydrophobic residues. For example, a Leu^" to lie muta- 
tion did not affect RIA-P, 9G8A. or secretion levels but 
mutations to Ala or Ser did (Figs I. 2. and 5), We conclude 
that these positions may be important for overall structure 
or be part of the hydrophobic core. 

Identification of regions containin}* tx -helices and con- 
necting loops. It has been suggested that mulatit^ns in (t- 
helices alter secondary and tertiary structure while mutations 
in connecting Unips are tolerated.'-" We hypoihesi/ed thai 
we may be able to identify such regions according to effects 
of the mutations on rHuEPO folding as measured by 9G8A 
immunoa.s.say. Most regions contained mutations that af- 
fected 9G8A immunoreactivity (sec Figs I or 2). However, 
there was one stretch of amino acids, 1 14 through 1 34, where 
mutations appeared to have little effect on 9G8A immunorc- 
activity. Thus, residues 1 14 through 1 34 may be a connecting 
loop unimportant for overall structure. 

We decided to also look for helices because of the a-port 
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Fig 6. Helical wheel presentation of amino acids 94 to 111 (HELIX 
C|. Amino acid residues (closest to the circle) represent those found 
in the human EPO sequence. Arrows point towards mutations (in 
boxes). Mutations that reduced D11 immunoreactlvity are shaded as 
indicated. The effect of each mutation on 9G8A immunoreactivity is 
shown in parentheses. 9G8A immunoreactivity values are shown as 
percent of total antigen present as determined by immunoassay (see 
text). The predicted buried and exposed surfaces of the helix are 
shown. 



(.liclcJ .solvent exposed re<:ion did iioi result in increases in 
^KiSA inifiunuireaetiviiy. These results are eonsistcnt witli 
a-helieal siriieuire Ibr ihe.se three regions. 

Sup|X)rt t\)r the existence Helix B (aas 62-81) conios 
from the Dl I epitope mapping data (Figs 2 and 5). Residues 
directly contacting the DI I antibody should be on the cx- 
po.sed surface of rHuRPO and mutations in them should 
result in large decreases in immunoreactivity. In contrast, if 
residues in a helix are buried but near lo residues that are 
part of the epitope then the immunoreactivity to variants 
containing these mutations should decrease but to a lesser 
extent because of nDnspecilic effects on secontlary and ter- 
tiary stiitciure. lixamination Helix B shows that the puta- 
tive exposeii surface contains the residues that appear to be 
inosl iniptinmi! for O! I imnuinorcacli viiy. Trp''\ Cilii'- and 
Leu'\ Mutations in these residues resulted in 10- fold, 25- 
fold, and 3()-fold decreases, respectively, in Dl I immunore- 
activity, but had normal 9G8A immunoreactivity. Thus, 
these three amino acids may be directly involved in Dl 1 
binding and are therefore likely lo be on the exposed surface 
of the molecule. Mulatitins in residues on the putative buried 
surface also resulted in decreased Dl 1 immunoreactivity but 
ti) a lesser degree. In addition, the DM immunoreactivity 
was inversely related to 9G8A immunoreactivity suggesting 
that the decreased DM iinmunoreactiviiy is a consequence 
of unfolding of rHuEPO. For example see residues 67. 70. 
and 74. 

Additional analysis of helix B allows prediction of po.ssi- 
ble amino terminal and carboxy tertninal helix caps. Resi- 



ihat rhuEPO i.s %Yf< a-helix.' Wc reasoned that helical re- 
gions in rHuEPO would be aiiiphipalhic. That is, there would 
be buried and exposed surfaces on the helices. Therefore, 
mutations in residues t>n the buried surface would likely 
affect rHuEPO structure whereas similar mutations on the 
exposed surface would not. This would be especially true 
Ibr inulaiions thai introduced pt^lar residues into buried posi- 
iit)ns. The elTccl of mutations on rHuEPO structure was 
monitored by 9G8A immunoreactivity as previously de- 
scril>ed. 

The entire length of rHuEPO was examined by plotting 
the immunoassay data for rHuEPO variants on helical 
wheels." The plots were then examined lo identify regions 
that conccmrated mutations thai resulted in high 9G8A im- 
munoreactivity on one side of the wheel. Tliree potential 
helices were idenlilied using this method (residues 62 
through KL % through I U) and 144 through 157: Figs 5, 6. 
and 7). 

A properly of amphipathic helices is that the buried sur- 
face contains pi-edominanlly hydrophobic residues and the 
exposed surface contains predominantly hydrophillic resi- 
dues. As shown in Figs 5. 6, and 7. each of the ihree regions 
containing the putative helices can Ik divided into surfaces 
coniaining predominantly polar or hydrophobic residues. 
Furthcnnore, the mutations in residues on the hydrophobic 
side of each wheel resulted in increases in 9G8A immunore- 
activity. In contrast, polar to nonpolar mutations in ihc pre- 
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Fig 7. Helkat wheel presentation of amino acids 140 to 157 (HEUX 
D». Amino add residues tclosest to the cirdel represent those found 
in the human EPO sequence. Arrows point towards mutations (in 
boxes!. The effect of each muUtion on 9G8A immunoreacthrity is 
shown in parentheses. 9G8A immunoreacth^ values are shown as 
percent of toUl antigen present as determined by Immunoassay (see 
text). The predicted buried and exposed surfaces of the helix are 
sho%vn. 
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iltk> aiDin.* tLTiiiin;il l.' ivst<!ik- 5S anJ Liirhoxy UTniin:il lo 

losi* ;iiMphip;tiIiK- siidcuiiv ;i».\ni\lini: lo lK-lic;il 
plots ol'^KiSA ininuiiiiMCJt'ltv ii\ , l"»'f CN.iinplc. it* the Ik'lix 
o\tciuiL-tl IvvotuI Ari: ihcn Al;i ' i> prciliclctl u> he dii iIk- 
Miii;icc. Iln\\v.'\cr. imnKliiciioii a Scr ivslclm- ai 
po.silioM I'i results in irKTca>cJ *KiSA iinimmorcacUvit\ (Ta- 
hk" I ». Iti an\ ca.sL'. it is unlikely that I Ik- \\cU \ oxtcnJs ho\t>tul 
Ai j: " hoeauso ivl the (Uy at 77. Glycine residues arc thought 
to lU'stahili/c helices :itul are (»riL-n ("oinul at helix ciuls.'* In 
atltiition. pt»Niti\el\ eharj:eil residues are ot'lcn \\n\iM\ near 
the u-lieli\ C"-ierininu> atui aeitlie residues are <it'ien loutul 
near the anuno terminus hecause ot their stahili/Jtig ertect 
on the helix vlipole. * This uouUI he consistent witli helix 
caps ;ii ciilici C ilu .^s oi ( ilu' aiul a ( icnnin.il cap ;ii Ai ^iK^ 

,\^ s\u>\\ )i ill i ie '» icstiinc^ itcr-^ fcn .liui I i'' nls" 'ia\ 
a helical uiicel paiicrn coiisisicni uiili ati n -helix. As uiili 
helix H. inuiaiions iluit rcMih in iticreaseil *K'iSA innmniDrc- 
aclivily are concentraieii on tuie sitle ot' helix (". A Gic ai 
l-msiiion I 12 suiii:esis tliai the helix stops prior to this posi- 
tion. Thus. Ari;"" is a possihie C'-terniinal helix cap. A possi- 
hle amino terminal helix ca(i is Asp". Helix caps at Asp"'* 
and Arjj"" \v*niU! result in a helix (he same leniiih as helix 
H. In <up(>i>rt ol thi^ proposed helix is the ohscrvaiiun iliai 
Ias" is thou*:hl to iv pariially hurled hecause ii reacts. bu\ 
p(H»rl\. v\ ith reauents thai react with L\s amino iiroups.' The 
')(IS.\ immunoassav results (Fiu fit suiZiiest that this residue 
nia\ be at the huried/surt'aee interface. The Dl I epitope 
majipini: results are aKo consisieni u ith a helix in this resjion. 
As w ith helix Ii the mutations that had the ijreaiest elTeel on 
iminunoieaciivity are concentrated on the putative exposed 
surface (Scr' Ari:'"\ riir'"", and Arj;""). Mutations in these 
amino acids result hi over l(M)-rold decreases in Dl 1 iiiimi]- 
noreacti\ ity. The exception is nuKations at I.eu'"\ However. 
Variants with mutations al Leu'"' are secreted poorly and 
have itnv RIA-T immunoreacti viiy suii^^esiini! that the muta- 
tions at ihi^ position altect loldiui: (ei:. ^ee Ser Tahle 1). 
()iher niMtaiions in (he pniatiw buried surface alsii atteci 
1)11 innnunoreacii\ it\ . lloucNcr. in all cases these muta- 
tions arc in hydrophobic amino acids and they result in in- 
creasetl *KiSA immuiu^reaclivity . Thcret'ore it is likely that 
the reduce^! imnuinoreaclivity is because of nt>nsix.'cific el- 
I'ecls <»n rlluIiPC) foldini:. 

Residues 144 to 157 also have a helical wheel palicrn 
consisicnt with an /r-helix. In u manner sitnilar to helices B 
and C, mutations thai incroiise 9GSA inimunoreaciiviiy are 
coneeniraied on the hydrophobic side of the helix. There is 
one chari:ed residue on the predicted buried surface. I.ys 
Mutatitins in l.ys'^* resulted in increased WSA immunoreac- 
tiviiy however (l-ij: 7). In addiiittn this residue is thou j: tit 
lo bo buried aceordini; to its po<»r reactivity with chemical 
nuHlilVini: reagents." '^ The location of the helix caps is un- 
clear. However il is unlikely that the lielix extends beyond 
(lly'*^ a helix breakini; residue. Ptissible amino terminal 
ends are Lys'*" and Val*". A helix cap siariiiitJ at Val'" 
would result in a helix the same length as (hose predicted 
iiv helices B and C II is unlikely thai the helix begins before 
a'sidue I3H because the F12 epitope mapping data suggests 
lhai I3S is at i»r near to the FI2 epitope. If ihe helix began 
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Fig 8. Helical wheel presentation of amino acids 5 to 23 (HELIX 
A). Amino acid residues (closest to the circle! represent those found 
in the human EPO sequence. Arrows point towards mutations (in 
boxes). The effect of each mutation on 9G8A immunoreactivity is 
shown in parentheses. 9G8A immunoreactivitv values are shown as 
percent of total antigen present as determined by immunoassay (see 
text). The predicted buried and exposed surfaces of the helix are 
shown. 



before I3S it would not be po.s^-ihk* tit position tlii.s residue 
near to the tJlher residues thouiiln to be pan of the epitope, 
residues ,^2 and SS. 

Human EK) has been reponed to fall into the 4-heli.\ 
bundle eategory of eytokine folds." '' Therefore there .should 
be a fouilh, amino terminal, heli.v. We plotted ili is region on 
a helieal wheel (Fii; S). Hydrophobic residues are generally 
eoneentraletl on one >ide of Ihe potential heli.v: Leu\ Me", 
t.eu'". l..eu"\ and l.eu' . Howexer. a helieal wheel plot dt»e> 
iK>i eoncenirate muialiinis lhat re>ull in itieiea.sed *^XiHA ini- 
munoreaetiviiy on any one side. A eomplieaiion with this 
analysis is thai the 9GSA epiti>pe maps to ihis region. In 
facL rHuL:l*0 with muialions in some of the putative buried 
residues. Glu'\ Leu'", and Leu' . all have redueed apparent 
<XiSA immuntireaelivity (previously deseribed). It these resi- 
dues arc indeed buried, it would explain the enhaneed inimu- 
noreaeiivity to rHuEPO with 9GXA upon unfolding. Iliere 
is support for a heli.x in this region as well as |Hissible orienta- 
tion of its ex|Tosed and buried surtaees. Tyr' is reiitlily iodin- 
aled NUgi:esiing its surfaee loealitm-.^ In addition. Arg'* is 
thought to be part of the aeti\e site. " This would put this 
residue on the surface as well. Finally, iwo mutations on 
the predieled buried surfaee. Leu'- to Ala and Ala'" lo Ser 
mutations resulted in high 9GKA imnuinoreaetiviiy sug- 
gesting that they are buried. Possible helix caps are Arg\ or 
Asp^. and l-ys*"". 

A modvl consistent w ith t/w antihoJy cftihtfn' mapinu}* is 
a 4'helix hundU', We have develojKd a model of rhuEK) 
siruelurc using the data reported here. The following eon- 
straints were used. Residues 7 and 161 were joined by a 
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Fig 9. Modeling of EPO structure. The model is based on informa- 
tion described in the text. Arrows oriented in the direction shown 
represent potential n-hefices labeled A, B, C, and D. Lines represent 
connecting loops. Numbers represent locations of the indicated 
amino acid residues where 1 is the amino terminus of rHuEPO follow- 
ing removal of the signal peptide. Disulfide bonds are indicated IC- 
Cl. 

Ji.sultKk' b*nKi. Residues 29 to .^3 were assumed to eonuuti 
;i lurn heeause ilic Cys'^'-C ys" bond. Ki2 epilojx' niappini: 
results su*ii:e>teil lhai residues 29 through 33. S6 throuiih 
SS. and I3S uere posiiioiied eloso to eaeh oiher. Dl 1 epitope 
niappini! resulis suggested ihai residues 64. 72. 75. KM). 103. 
and 106 were also close lo each olher. 90 S A mappini: results 
indiealed that iis cpiu>i>e includes residues 13. 16. and 17. 
The results IVoni the accompanying paper" suggested that 
MoAbs J' 1 2. DM. and 9G>^A recogni/e nonovcrlapping epi- 
topes. Thorelorc residues in each of the three epitopes were 
separated. This data allowed prediction of an overall protein 
fold. We then superimposed on the model the predicted heli- 
ces (S tha)ugli 2(». 62 through 76. 96 ihrougli I H). and 144 
through 157) and connecting loop (I 14 through 134). 

The model of rHuIiPO structure is shown in Fig 9. This 
structure includes 4 closely packed helices labeled A. B. C 
and D with a up-up-down-down helix topi>logy. Long l<K)ps. 
33 through 62 and I 10 through 140. connect helices A to B 



;nit] (' lo i). lespeciis el\'. I'lie icnuIucn iinpnriiini tor (Moiein 
st;ihiltl>'. 42. 70. 10."^. 14>s. I. "^2 ;mu1 INl arc posiuoiieil in 
.iiv;is con.sisiem with their being p;iri dl' a h^tlmphohie <oi\\ 

1 he 1)1 1 epitope included heliecN B and C. The 9CiS.\ epi- 
tope is in the A helix opposite lVt)in (he 1)1 t epitope, riie 
112 epitope includes the (xucniial (urns on (he botinni ol 
the sirucdire. 

DISCUSSION 

We describe here an immunoUigical apj^rojich to deicnni- 
nalion of rHuRPO structure. Crucial tt> this approach t> the 
av :iiiability (tt' individual rHuRPO point nuMations atul ilie 
;thili(y lo tletcrniine the effects of the niu(aiit>ns on toUlini: 
.lia! i[iununoj'eacii\'il\ . f wc JilTcrcnt in;:ii:m> ':l^^a^ - 
u'-eJ M'l lht> elloit. MicttKlinL: > uilieit^iu iHt iU'v!* 'iiai - .ii.vi 

2 dilterent polyclonals. In addition 2 ilillerent vecoiular) 
amibtKiies were used in ULISAs. The 3 luunoclonals recog- 
nized nonovcrlapping epito|X!S which reduced the |>ossibitily 
that nnilalions would reduce iinnuinoreactivity for all 3 si- 
nudiancously. The combination of all these assays residteil 
in a high probability that at least one could be \'oun<\ to 
cstiniaie rHulfiPO C(Miceniration. 

rhe predicted structure of rHui:-PO based on the epitt>pe 
n Kipping and 908 A tolding experiments is a 4 -helix bundle 
il iL: 9 J. There is other tlaia consistent with this uukIcI. I-irsl. 
mouse and human EPO sequences are highly conser\ed' 
arui both molecules stimulate human and mouse erythro|>i)ie- 
'i'liis suggests that the structures of the two molecules 
are similar, particularly in the receptor binding iegioM>. 
.Nhuisc fiPO has Cys'' to Pro and Arg'"' to Cys substitutions 
as compared to human l:PO, It has been proposed that in 
mouse HPO Cys'*' and Cys'*** form a disullide bond. The 
model described here positions residues 29 and !39 cK)se 
together. Second, amino acid residues 32 to 36. 53 lo 57. 7S 
to S2. and III to 119 can he deleted yei the protein can 
maintain its structure.' We have also found lhai residues in 
tlu: I 1 I through 139 region can be mutatetl wiih<nii destabi- 
lizing the structure. Ltxip regions can often tolerate insert ion ^ 
and deletions while helical and beta sheet regions tl<i 
no(.'"*'^ In agreement with our model, these amino acids are 
predicted to be part of either turns or interconnecting Imips. 
Third, crossi inking studies indicate that certain residues arc 
cli»se together.*' These include the amino tenninus and Lys"'*. 
Lys'' and Lys"", Lys'- and Lys"*\ and Lys- and Lys" In 
addition the p<Kir reactivity of Lys'^- suggests that it is bur- 
ied. These cn>ssl inked residues are all positioned relatively 
close to each other on our model. Finally, active site residues, 
Arg'\ Arg'"\ Ser"", and Gly"*' .arc positioned on pre- 
dicted surfaces and generally conceniraie in patches on the 
model. 

Several different models of EPO structure have been pro- 
posed elsewhere, all based on 4-helix bundles.''"' The mtxJels 
proposed by Ba/an.** Bois.sel et al.^ and Hanui el al" are all 
similar to the nnxiel proposed here. However they all differ 
in the location of helices and lums. Tlie Bazan model was 
developed according to structural homology to human 
growtli hormone. It predicts longer helices than those pro- 
ptKcd here. The Boisset m(xJel was developed using second- 
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ary struciure predictive methods and computer modeling. 
This model was consistent with deletion mutant data.^ Both 
models are consistent with some but not all of our data. For 
example, the Boissel model positions residue 138 distant 
from residues 32 and 88. Epitope mapping with MoAb FI2 
positions these regions cIo.se together. 

It has been reported that EPO neutralizing antibodies in- 
hibit binding of EPO to its receptor." The Dll antibody 
described here is neutralizing. This suggests that its epitope, 
residues 99 through 110 and 64 through 79, may include 
the receptor binding domain, Sytowski and Donahue-*" also 
identified neutralizing antibodies that were raised against 
synthetic peptides. This study showed that antibodies raised 
against peptides 99 through 118 and 111 through 129 would 
neutralize EPO activity. In vitro mutagenesis studies indicate 
further that the 100 through 110 region is important for 
biological activity but the 64 through 79 region is not.'*'"^' 
This is consistent with our observations. The 9G8A antibody 
epitope maps to residues 13, 16, and 17. It did not neutralize 
rHuEPO activity." Sytowski et al also raised antipeptide 
antibodies to residues 1 through 26 that were reported to be 
nonneuiralizing.^- In contrast Egrie et al'^ raised a neutraliz- 
ing antibody to an amino terminal (residues I through 20) 
peptide. In addition, Arg"* has been reported to be part of 
the active site." These results suggest a role for the amino 
terminal region in biological activity. Therefore, the N -termi- 
nal region active site residues involved in receptor binding 
must be positioned in such a way that they would not be 
stcarically hindered by 9G8A. Indeed, the 9G8A epitope 
may be on the buried surface of the helix. Another possibility 
is that the 9G8A antibody is unable to neutralize rHuEPO 
activity because the affinity of 9G8A to its epitope is too 
low to block receptor binding when rHuEPO is properly 
folded. Additional information on the location of the 
rHuEPO active site may resolve the issue. 

ACKNOWLEDGMENT 

Wc thank B. Aguero. J. Grant. D. Greene, and X. Wang for expert 
technical assistance: J. Egrie for technical assistance and helpful 
discussions. K. Chen, S. Suggs. A. Harcourte, J. Katzowitz, and Ann 
Janssen for DNA sequence analysis and T. Jones, E. Fisher and Burt 
Goodman for synthetic peptides and oligonucleotides. 

REFERENCES 

1. Lin F-K. Suggs S, Lin C-H, Browne JK, Snialling R, Egrie 
JC, Chen KK, Fox GM. Martin F, Siabinski Z, Badrawi SM. Lai 
P-H, and Goldwasscr E: Cloning and expression of the human eryth- 
ropoietin gene. Proc Natl Acad Sci USA 82:7580, 1985 

2. Jacobs K, Shoemaker C. Rudersdorf R. Neill SD, Kaufman 
RJ. Mufson A. Scchra J, Jones SS. Hewick R. Friisch EF, Kawakiia 
M, Shimizu T, Miyake T: Isolation and characterization of genonuc 
and cDNA clones of hunum erythropoietin. Nature 313:806. 1985 

3. Browne JK, Cohen AM. Egrie JC, Lai P-H, Un F-K, Strickland 
T, Watson E. Stebbing N: Erythropoietin: Gene cloning, protein 
structure, and biological properties. Cold Spring Harbor Symp Quant 
Biol 51:693. 1986 

4. Egrie JC, Strickland TS, Lane J. Aoki K, Cohen AM, SmalUng 
R, Trail G. Un F-K, Browne JK, Hines DK: Characterization and 
biological efTects of recombinant human erythropoietin. Immunobi- 
ology. 172:213, 1986 



5. Davis JM, Arakawa T. Suickland TW'. Yphanti>s DA: Charac- 
terization of recombinant human erythropoietin produced in Chinese 
hamster ovary cells. Biochemistry 26:263.'^. 1987 

6. Haniu M, Narhi LO. Arakawa T, Elliou S. Rohde MF. Recom- 
binant human erythropoietin (rHuEPO): Cross-linking with disuccin- 
imidyl esters and identification of the interfacing domains in EPO. 
Protein Sci 2:1441. 1993 

7. Boissel J-P, Lee WR, Presnell SR. Cohen FE, Bunn HF: Eryth- 
ropoietin structure -function relationships, mutant proteins thai test 
a model of tertiary structure. J Biol Chem 268:15983. 1993 

8. Goldwasser E: Structure- function relationships of erythropoie- 
tin, in Erslev AJ, Adamson JW, Eschbach JW. Wincarls CG (cds); 
Erythropoietin, Molecular. Cellular and Clinical Biology. Baltimore. 
MD, Johns Hopkins University Press, 1991. p 41 

9. Bazan JF: Haemopoietic receptors and helical cytokines. Im- 
munol Today 11:350, 1990 

10. Jin L. Cohen FE, Wells J A; Struciure iroin tunciiun: Screen- 
ing structural models with functional data. Proc Nail Acad Sci USA 
91:113. 1994 

11. Elliott S, Boone T. Chang D, Delorme E. Dunn C, Egrie J. 
Gifrtn J, Lorenzini A. Talbot C, Hesterberg L: Isolation and charac- 
terization of conformation sensitive ami-er>thropoietin monoclonal 
antibodies: Effect of disulfide bonds on rHuEPO structure. Blood 
87:2714, 1996 

12. Freimuth PI, Taylor JW, Kaiser ET: Introduction of guesi 
peptides into Escherichia coli alkaline phosphatase. J Biol Chem 
265:8%, 1990 

13. Pakula AA, Sauer RT: Genetic analysis of protein stability 
and function. Annu Rev Genet 23:289, 1989 

14. Bowie JU, Reidhaar-Olson JF, Lim WA. Sauer RT: Deci- 
phering the message in protein sequences: Tolerance to amino acid 
substitutions. Science 247:1306, 1990 

1 5. Egrie J, Lane J: Use of monoclonal and polyclonal aniipepiide 
antibodies to assay and characterize erythropoietin. Fed Proc 
43:1892, 1984 

16. Uw ML, Cai G-Y, Lin F-K, Wei Q. Huang S-Z, Hartz JH. 
Morse H, Lin C-W. Jones C, Kao F-T: Chromosomal assignment 
of the human erythropoietin gene and its DNA polymorphism. Proc 
Natl Acad Sci USA 83:6920, 1986 

17. Yanisch-Pcrron C. Vicra J, Messing J: Improved M13 phage 
cloning vectors and host strains: Nucleotide sequences of ihe 
M13mpl8 and pUC19 vectors. Gene 33:103, 1985 

18. Kunkel TA. Robeas JD, Zakour RA: Rapid and efticieni siie- 
specitic muUgcncsis without phcnotypic selection. Methods Enzy- 
mol 154:367. 1987 

19. Messing J: New ml 3 vectors for cloning. Methods Enzymot 
101:20, 1983 

20. Wiglcr M, Pclliccr A, SiWerstein S, Axel R: Biochemical 
transfer of single-copy eucaiyotic genes using total cellular DNA as 
donor Cell 14:725, 1978 

21. Lin F-K, Lin C-H, Lai P-L. Browne JK, Egrie JC. Smalling 
R. fox GM, Chen KK, Castro M, Suggs S: Monkey erythropoietin 
gene: Cloning, expression and comparison with the human erythro- 
poietin gene. Gene 44:201, 1986 

22. Schiffer M, Edmundson AB: Use of helical wheels to repre- 
sent the structures of proteins and to identify segments with helical 
potential. Biophys J 7:121. 1967 

23. Richardson JS, Richardson DC: Principles and pattenvs of 
protein conformation, in Fasman GO (ed): Prediction of Protein 
Stnjcture and the Principles of Protein Conformation. New York, 
NY. Plenum, i989, p I 

24. Salake K Kozulsumi H, Takeuchi M, Asano K: Chemical 
modification of erythropoietin: An increase in in vitro activity by 
guanidination. Biochim Biophys Acta 1038:125. 1990 



EPITOPE MAPPING OF ANTt-EPO MoAbs 



2713 



25. Feldman L. Chung T, Zurtuch D, Donahue KA, Powell JS, 
Syikowski AJ: Identification of a stnjctural role for tyrosine- 1 45 in 
Human erythropoietin. Blood 74:194a. 1989 (abstr) 

26. Wen D. Biossel J-P. Showers M. Ruch EC. Bunn HF: Eryth- 
ropoietin structure-function relationships; Identification of function- 
ally important domains. J Biol Chem 269:22839. 1994 

27. McDonald JD, Lin F-K, Goldwasser E: Cloning, sequencing 
and evolutionary analysis of the mouse erythropoietin gene. Mol 
Cell Biol 6:842, 1986 

28. Shoemaker CB. Mitsock LD: Murine eo'thropoietin gene: 
Cloning expression, and human gene homology. Mol Cell Biol 
6:849. 1986 



29. Cotes PM, Bangham DR: Bio-assay of erythropoietin in mice 
made polycythacmic by exposure to air at reduced pressure. Nature 
191:1065. 1961 

30. Grodberg J. Davis KL, Sylowski AJ: Alanine scanning muta- 
genesis of human erythropoietin identifies four amino acids which 
are critical for biological activity. Eur J Biochem 218:597. 1993 

31. D' Andrea AD. Szklut PJ, Lodish HF. Alderman EM: Inhibi- 
tion of receptor binding and neutrali^lion of bioactivity by anti- 
erythropoietin monoclonal antibodies. Blood 75:874, 1990 

32. Sylowski AJ. Donahue KA: Immunochemical studies of hu- 
man erythropoietin using site-specific anti-peptide antibodies. J Biol 
Chem 262:1161. 1987 



Erythropoietin Structure-Function Relationships: High Degree 
of Sequence Homology Among Mammals 

By Danyi Wen, Jean-Paul R. Boissel. Timothy E. Tracy, Robert H. Gruninger, Linda S. Mulcahy, John Czelusniak, 

Morris Goodman, and H. Franklin Bunn 



To investigate structure-function relationships of erythro- 
poietin (Epo), we have obtained cDNA sequences that en- 
code the mature Epo protein of a variety of mammals. A 
first set of primers, corresponding to conserved nucleotide 
sequences between mouse and human DNAs, allowed us 
to amplify by polymerase chain reaction (PGR) intron 1/ 
exon 2 fragments from genomic DNA of the hamster, cat, 
lion, dog, horse, sheep, dolphin, and pig. Sequencing of 
these fragments permitted the design of a second genera- 
tion of species- specific primers. RNA was prepared from 
anemic kidneys and reverse -transcribed. Using our battery 
of species-specific 5' primers, we were able to success- 
fully PCR-amplify Epo cDNA from Rhesus nr>onkey, rat, 
sheep, dog, cat, and pig. Deduced amino acid sequences of 
mature Epo proteins from these animals, in combination 
with known sequences for human, Cynomolgus monkey, 
and rrK)use, showed a high degree of homology, which 

ERYTHROPOIETIN (Epo) is the hematopoietic cyto- 
kine that regulates red blood cell productioa. In mam- 
mals, this 34-Kd glycoprotein hormone is produced in the 
fetal liver and adult kidney, circulates in the bloodsueam, 
and binds to receptors on committed progenitor cells in the 
bone marrow and other hematopoietic tissues, resulting in 
proliferation and terminal maturation of erythroid cells.' 
The expression of both Epo mRNA and Epo protein is 
markedly increased by hypoxia, owing to a 3' enhancer and 
to highly conserved elements in the promoter region.^*^ This 
elegant servomechanism enables Epo to regulate the red 
blood cell mass of humans and other animals. 

A large number of early physiologic studies have estab- 
lished extensive, although incomplete, biologic cross-reactiv- 
ity between the Epos of humans and a number of other 
mammals, including the mouse, rat, sheep, and rabbit.**^ 
In contrast, nonmammalian vertebrates (amphibians,^*'^ 
birds,"**^ and fish*') have erythropoietic hormones that fail 
to cross-react with mammalian erythroid cells, and vice- 
versa.'* * 

Thus far, the Epo genes of a human,** *^ a monkey (Ma- 
caca fascicularis),** and a rodent (the mouse"**') have been 
cloned, sequenced, and expressed. In view of the marked 
cross-reactivity between mammalian Epos, it is not surpris- 
ing that there is a high degree of sequence homology in the 
coding region of the mature secreted proteins. In keeping 
with their close phylogenetic relationship, human and mon- 
key Epos are 94% and 91% identical in nucleotide and 
amino acid sequence, respectively. In contrast, human and 
mouse Epos are 76% identical in nucleotide sequence and 
80% identical in amino add sequence. 

EMrect information on the three-dimensional structure of 
Epo is not yet available. Insights into structure-function re- 
lationships of Epo can be gained from the analyses of a more 
complete set of animal sequences. Such information could 
be useful for sequence-based computer modeling of three- 
dimensional structure. Moreover, a larger data base would 
permit the identification of highly conserved domains that 
are likely to be cnidal to folding and/or biologic function. 



explains the biologic and immunological cross- reactivity 
that has been observed in a number of species. Human Epo 
is 91% identical to monkey Epo, 85% to cat and dog Epo, 
and 80% to 82% to pig, sheep, mouse, and rat Epos. There 
was full conservation of (1) the disulfide bridge linking the 
NHj and COOH termini; (2) N-glycosylation sites; and (3) 
predicted amphipathic a-helices. In contrast, the short di- 
sulfide bridge (C29/C33 in humans) is not invariant. Cys33 
was replaced by a Pro in rodents. Most of the amino acid 
replacements were conservath/e. The C-terminal part of 
the loop between the C and D helices showed the most 
variation, with several amino acid substitutions, deletions, 
and/or insertions. Calculations of maximum parsimony for 
intron 1 /exon 2 sequences as well as coding sequences 
enabled the construction of cladograms that are in good 
agreement with known phylogenetic relationships. 
© 1993 by The American Society of Hematology, 

Finally, comparative amino acid and nucleoiide sequence 
information on Epo provides additional data for investigat- 
ing phylogenetic relationships. 

We report here the use of polymerase chain reaction 
(PCR)-based techniques for obtaining the full coding se- 
quences of Epos of the Rhesus monkey, rat, sheep, pig, dog, 
and cat, as well as partial sequences from three other spe- 
cies. The design and preparation of mutants that test a 4 a 
helical bundle model of Epo iertiar>' structure have been 
presented elsewhere.** 

MATERIALS AND METHODS 
Animal Samples 

Human hepatoma Hep3B cell line and multiple kidney-derived 
ceU lines from hamster (BHK), sheep (MDOR). pig (LLC-PKl), 
dog (MDCK), cat (CRF-K), lion (PALl-K). and spotted dolphin 
(SP 1 -IC) were obtained through the American Type Culture Collec- 
tion (Rockvillc, MD). Monolayer cells were grown in 100 x 20 mm 
tissue culture dishes using recommended media and maintained in 
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Localization 
(bp) 

human (^) mouse ^^^ 



IV1 5' TGAAGTTTGGCCGAGAAGTGGATGC 3' 

EX2R 5' AAGA(T/G)GTACCTCTCCAG(A/G)ACTCG 

EX5 5' CTGCTCCACTCCGAACA(C/A)TCAC 3' 

NCOl 5* CTGGAGTGTCCATGGGACAG 3' 



3' 



984-1009 
1296-1318 
2651-2672 
2885-2904 



994-1018 
1302-1324 
3183-3204 
3415-3434 



ATG 5' AGGCGCGGAGATGGGGGTGC 3' 



615-634 



633-652 



Fig 1 . Oligonucleotide primer sequences. The localization of the prinner coincides to previously published nucleolidic sequences for the 
human'* (1 ) and murine" (2) Epo genes. EX2R and EX5 are not completely conserved between the human and mouse (respectively. 93% 
and 95% identity). An equal amount of each possible nucleotide was incorporated during the corresponding cycles of those primer synthe- 
ses. EX2R and NCOl are reverse primers and their sequences represent the antisense DNA strand. 



a humiditial ^'l C0;/y5'- air iiuubaior at M^'C. in some cxpcri- 
mVnis. cells wore made hypoxic hv overnight incubation in 1'^ O.. 
5'-; CO,, and 9-4':;. N, at -^7X. 

Poly A* RNA prepared iVom Rhesus monkey kidney was pur- 
chased iVom Clontcch (Pato Alio, CA). Kidneys from anemic cat 
and horse were obtained from veterinarians at Tufts School of Vct- 
erinar> Medicine (Grafton. MA) and the University of Nebraska 
(Lincoln. .Nil), respectively. Anemia was induced by three con.secu- 
tive daily intraperitoneal injections of phcnylhydrazine (60 ni&,'kg 
bixiy weight) into male Sprague-Dawley rats (SD-siratn) and by 
repealed bleeding of the dog. the sheep, and Ihe pig. Kidneys were 
aseptically removed after induction and stored immediately in liq- 
uid nitrogen. 



A 4-kb genomic human Hpo clone (tilipo-l) was provided by Ge- 
netics Instiiuie (B<iston, M A). 

DNA and RNA Preparations 

DNAs from cell lines or homogeni/.ed kidneys wore extracted 
using pancreatic RNAase/SDS/ proteinase K following a prtKcdurc 
modified from Rlin and Staiford.^' 

Kidneys vvere homogenized in 4 niol/1, guanidine iMUhiiKV anaio 
(It) niL/g of frozen organ) containing D. 1 mol/I. .^mercapioethanol 
and 2^;^^ N-laur>l-sarcosine. Toial RNAs were i.^ilated by ccntrifuga- 
tion over ^.1 mol/LCsCl.-' .After eihanol precipitation, the s;implcs 
were resuspended in diethyl pyrocarbonaie-ireaied water and 



genomic DNA 



5 ' 



• xon 1 



IV1 



•xon2 eKon3 exon4 



e X o n 5 



^^Y/ IV2 IV3 V//. 



IV4 



I V/A 



c-DNA 




Fig 2. PGR strategy used for the cloning of mammalian cONAs conUining the complete coding sequence of the mature Epo protein. 
Genomic amplification and sequencing of exonic fragments, localized upstream and downstream from the nucleotide portion coding for the 
mature protein, allowed the design of species specific primers (SP). Use of those SP primers and/or of primers. 1 00% conserved between 
the human and mouse (5 ATG and 3 NCOl) on cONA templates prepared from kidney of uninduced or hypoxia-induced animals, allows the 
amplification of a variety of mammalian Epo clones. Sense (-) and antisense (— ) primers are represented by the arrows. Dashed boxes 
correspond to the coding part {propeptide and mature protein) of the five Epo exons. Shaded boxes represent the 5 and 3 untranslated 
regions. 



MAMMAUAN ERYTHROPOIETIN SEQUENCES 



1509 



1010 1020 1030 104O J050 1060 1070 

IVl . • . . . . , 

human *- T6GTAGCT GGGGGTGGGGTGTG CAC ACGG C AGC AGGATTGAAT6AAGGCCAGGG AGGCAGC AC 

pig CTCGTAGTT GTT GGGTGGCGTGTG CAC GCGG TCAGTGTATT6AAT6AAGGCAAAGGAGGCAGAGC 

aotphin GTAGCTC GGGGGGTGGGGTGT6 CAC GCGG CCGCGGGATTGAATGAAGGCAAAGGAGGCAGAGC 

sheep CCGTAGCT GTTGGG6TGGGGT6CG CAC GAAG CCGCGGGATTGAATGAACGC GGAGGCAGAGC 

horse TG6GGGTGGGGTGTG CA6 GCGGCAG CAGGATTGAATGAAGGCAGAGG AGACAGACC 

oog CAGGTCAGTTTGATTGGGTG6 ATGTGTGCCGCGAAGCAGCGG CCGGATTGAATGAAGGC AGGGGA66CAGAAC 

cat TGGTAGTTT GGC6GGTGGTGTGT6CTCACCGCGGCGGCGGCGGGGATTGAATGAAGGCCAAGGAGGCAGAAC 

I ion GGTAGTTT GGC6GGTGG66T6TGCTCACCGCGGCGGCGGCGGGGATTGAATGAAGGCCAAGGA6GCAGAAC 

mouse C6GTCGCTT GGGGG6TGGGGTGTG CAGCGCGG A66GATTGAATGAA6GCCACTC AGGCAGAGC 

hamster ATGGTGGCTT 6GGGGGT6GTGTGTG CAGCGCCG GGAGATAGAATGAAGGCCACTC AGGCAGAGC 

consensus 6GGTGG--TG-G--C 6 AT-GAATGAA-GC AG-CAG--C 



human CTGAGTGCTTGC ATGGTTGGGGACAGGAAGGACGAGCTGGGG CAGAGACG T6GGATGAAGG AAGCTGTCCTTCC AC 

pig CTGCGCGCTCGCAAGGTTGG66TCGGAAACGGCTGGCTGGGGGCAA 6ACGGCCGGATGGGTGAACCTGTCCGTCTAA 

dolphin CTGAGCGCTCGCAAGGTTC66GTCGGGAGGAC TCGC GGG6CCAGAGCAG CGGGATGGGTGAACGTGCaGGTCCAA 

sheep TGAGCGCTCCCAAGGTCCGGGTGGGGAGGGC TCACTGGGGGCTGCGCAG CGGGATGGGTGAACCTGCCGGTCCAA 

horse CTGAGCGCTGGGAAGGTTGGGGGCAGGAGCCACTAGCTGGGGGCAGAGGAG GGGGATGCGTGAACCTGCCCCTCCAA 

dog CTGAACGCTGGGAAGGTGGGGGTCGGGCGCGACTAGTTGGGGGCAGAGGAG CGGGATGTGTGAACCTGCCCCTCCAA 

cat CCGAGCGCTGGGAAGGTTCGGGGTGGGAGCGACAAGCTGGGGGCAGAGGAA CGGGATGTGTGAACCTGCTCCTCCAA 

1 i on CCGAGCGCTGGGAAGGTTCGGGGTGGGAGCGACAAGCTGGGGGCAGAGGAA CGGGATGTGTGAACCTGCTGCTCCAA 

mouse CTAAGCAATTGCAAGGTCGGGGTCAGCAGAGACTAT AAGGGCAGAGGGGTCTCGCTGAGCCAACCGGCCCCTGCAG 

hamster CTAAGCAGTTGCAAGGTCGGGGTCAGCAGAGACTGG AAGGGCAGAGGAGCCTCGCTGTGCAAACCGGTCTCTGGTC 

consensus --T---A-GGT--66G---6 6G-C---G-- G-TG AA---G T 



1150 



1160 



1170 



1180 



1190 



human AGGCCACCCTTCTCCCTCCC CGCCTGACTCTCAGCCT6GCTATCTGTTCTAG 

pig A CCAACCCTTCTCCTGCAATGCCCAGCCTCACATTCAGCCTGGCTCTCTTTCCTAG 

do Iph i n A CCACCCCTTCTCCTGCAAGTCCC66CCTCACACTCAGCCCAGCTCTCTCTCCTAG 

sheep A GCAGCCCTTCTCCTGCGA6AATGCACCTCATCCGCAGCCCGTCTCTCTTTCCTAG 

horse A CCAACCCTCCCCTTCCCCTCCCT GCCTCACACTCAGCCCG6ATCTCCTTTCCAG 

dog A CCCA CACAGTCA6CCTGGCACTCTTTTCCAG 

cat A CACACTCAGCCTG6CACTCTTTTCTAG 

lion A CACACTCA6CCTGGCACTCTTTTCTAG 

mouse TCCCAC6GCCCCC TCCCCTCTC GGCCTCACACTCAGCCT6CCTTTC TTCCAG 

hamster GTCCCACGGCCC TCCCCTCCC AGCCTCACACCCAGCCTGCCTTTC TTCTA6 

consensus A CAGCC-G TC--T-C-AG 



1210 

AATGTCCTGCCTGGCTG 
AATGTCCTGCCCGGCT6 
AATGTCTTCTGCTGCTT 
ACTGTACTCCGCT6CTG 
AATGTCCTGCCCTGCTG 
AATGTCCTGCCCTGCTC 
AATGTCCTGCCCTGCTG 
AATGTCCTGCCCTGCTG 
AACGTCCCACCCTGCTG 
AACGTCCAACCCT6CTG 
A--GT GCT- 

£XQN 2 



1230 



\2A6 



1250 



1260 



1270 



1280 



1290 



human TGGCTTCTCCTGTCCCTGCTGTCGCTCCCTCTGGGCCTCCCAGTCCTGGGC6CCCCACCACGCCTCATCTGTGAC AGC-< 

prg CTTCTGCTATCTTTGCTGCTGCTTCCTCT6GGCCTCCCA6TCCTGGGCGCCCCCCCACGCCTCATCTGTGACAGC 

dolphin CTGCTGCCCTCCTT6CTGCTGCTTCCTCTGGGCCTCCCAGTCCTGGGCGCCCCC ACACACCTCATCTGTGACAGC 

sheep CTGCTGCT6CT6TCTCTTCT6CTGTTTCCTCTGG6CCTCCCAGTCTTG66C6CCCCCCCACGCCTCATCTGTGACAGC 

horse CTTCTGCT6TCCCT6CTACT6CCTCCTCTGG6CCTCCCA6CCCTGGGC6CCCCTCCACGCCTCATCTGTGACACC 

dog CTTTTGCTGTCTTTGCTGCTGCTTCCTCT66GCCTCCCA6TCCT66GC6CCCCCCCTCGCCTCATTTGTGACAGC 

cat CTTCTGCTATCTTTGCTGCTGCTTCCCCTGGGCCTCCCA6TCCTGG6CGCCCCCCCTCGCCTCATCTGTGACAGC 

t ion CTTCTGCTATCTTT6CTGCTGCTTCCCCTGGGCCTCCCAGTCCT6GGCGCCCCCCCTCGCCTCATCTGT6ACAGC 

mouse CTTTTACTCTCCTTGCTACT6ATTCCTCTGGGCCTCCCAGTCCTCTGT6CTCCCCCAC6CCTCATCTGCGACAGT 

hamster CTTTT6CT6TCTTTGCTCCTGCTTCCCCT6G6CCTCCCA6TCCTCT6CGCTCCCCCAC6CCTCATCT6C6ACAGC 

consensus ---CT--T-C--TC--T-CT-—G---CC-CT66GCCTCCCAG-C-T--G-GC-CC--C-C-CCTCAT-TG-GACAG- 



EX2R 



Fig 3. Comparison of I VI /EX2R sequences from various mammals. The numberir>g corresponds to the published human genomic se- 
quence.'* The reported human sequence was obtained from the amplification of purified genomic DNA from Hep3B cells and agreed with 
the sequence prevtously reported. The mouse sequence is from McDonald et al.'' The horse sequence was obtained from kidney-extracted 
genomic DNA. All the other sequences were established from PCR amplification of genomic DNAs purified-ffrom several mamnialian 
renal-derived cell lines. The consensus sequence indicates the positions In which a unique nucleotide was found in all of the reported 
species. The boundary between intron 1 and exon 2 is represented by the ascer>ding arrows. The locations of the two PCR primers. IV1 and 
EX2R, are shown by the dashed arrows. 



stored at -70**C. Confluent monolayer cells were washed twice 
with sterile phosphate-buffered saline and directly lysed in the 4 
molA- guanidinc isothiocyanalc solution. Total RNAs were iso- 
lated as described above for the kidneys. 

RN A samples were first converted into single-strand cDNA. Two 
to 4 Mg of total RNA from kidney or 500 ng to I #ig of total RMA 
from cultured cells was denatured at 68*C in presence of 2 Mg of 
oligo dT(,}). Reverse transcription was peiformed in a 20 final 
volume, containing 50 mmol/L Tris-HQ, pH 8 J, 6 mmol/L 
MgOj. 40 mmolA* KO, 10 mmol/L dithiothrcitol (DTT)t 500 



Mmol/L dNTPs, 20 U RNAsin (Promega, Madison, WI). and 20 U 
of avian myeloblastosis virus reverse transcriptase (AM V-SuperRT; 
Molecular Genetic Resources, Tampa, FL). The reaction was al- 
lowed to proceed at 37"C for 20 minutes and then at 42'C for 1 
hour. Inactivation of the enzyme was performed at IOO*C for 10 
minutes. cDNA samples were stored at -70*C. 

PCR 

Twenty to 100 ng of genomic DNA or 0.5 to I mL of the RT-reac- 
tion were used as templates for amplification. 
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, 50 

AGOCGCCGAG ATGGCGGTGC ACGaaIGTCC TGCCIOGCTG TGGCTTCTCC TGTCCCTGCT GTCGCTCCd CTGGGCCTCC CMiTCClGGG 

ACGAATGTCC TCCCTGGCTG TGGCIKKC TGTCTCKGT GTCGC7CCCT CTGGGCCTCC CAG?CCCGGG 

CCCAACCTCC CACC---CTC CTGCTTTTAC TCTCCTTGCT aCTGATTCCT CTGGGCCTCC CaQTCCTCIC 

gSr^ CCGAACGTCC CACC---CTC CTCCTTITAC TaTCCTTGCT aCTCATTCCT CTGGGCCTCC CaGTCCTCTG 

Irhi, GCGACTGTAC TCCCCTCCTG CTGCTGCTCC TGTCTCTTCT GCTGTTTCCT CTGGGCCTCC CAGTCTTGGC 

GAATGTCC TGCCCGG--- CTGCTTCTCC TaTCTTTGCT CCTCCTTCd CTGGGCCTCC CAGTCCTCGC 

TUT GAATGTCC TGCC---CTG CTGCTTCTGC TaTCITTGCT GCTGCT7CCC CTGGCCCTCC CAGTCCTGGC 

GAATGTCC TGCC---CTG CTCCTTTTGC TGTCTTT&CT GCTGCTTCCT CTGGGCCTCC CAGTCCTGGG 



100 

CGCCCCACCA CGCCTCATCT GTGACAGCCG 

CGCCCCACCA CGCCTCGTCT GTGACAGCCG 

TGCTCCCCCA CGCCTCATCT GCGACAGTCC 

CGCTCCCCCA CGCCTCATTT GCGACAGTCC 

CGCCCCCCCA CGCCTCATCT GTGACAGCCG 

CGCCCCCCCA CGCCTCATCT GTGACAGCCG 

CGCCCCCCCT CGCCTCATCT GTGACAGCCG 

CGCCCCCCCT CGCCTCATTT GTGACAGCCG 



150 

AGTCCTGGAG aGGTaCCTCT TGGAGGCCAA 

agtcctggag aggtacctct tggaggccaa 
agttctggag aggtacatct tagaggccaa 

CGTTCTGGAG AGGTACATCT TGGAGGCCAA 
AGTCCTGGAG AGGTaCATCC TGGAGGCCAG 
AGTCCTGGAG AGGTACATCT TGGAGGCCAA 
AGTCCTGGAG AGGTACATTC TGGAGGCCaG 
GGTCCTGGAG AGATACATCC TGGAG 



ggagcccgac aatatcacga cogoctgtgc 

cgaggccgag aatgtcacca tggcctgttc 

ggaggcagaa aatgtcacca tgogttctgc 

ggaggcagaa aatgtcacaa tgcgctgtgc 

ggaggccgaa aatgccacca tgggctgtgc 

ggagggcgaa aatgccacca tgggctgtgc 

ggaggccgaa aatgtcacca tgggctgtgc 



200 

tg*^cactgc agcttgaatg AGAATATCAC 
CGAAAGCTGC AGCTTCAATG AGAATATCAC 
AGAAGGKCC AGACTGAGTG AAAATATTaC 
AGAAGGTCCC AGACTGAGTG AGAATaTTAC 
AGAAGGCTGC AGCTTCAGTG AGAATATCAC 
CGAAAGCTGC AGCTTCAGTG AGAATATCAC 
TGAAGGCTGC AGCTTCAGTG AGAATATCAC 



TGTCCCAGAC ACCAAAGTTA ATTTCTaTGC 
CCTCCCAGAC ACCAAAGTTA aCIICIaIGC 

agtcccagat accaaagtc* acttctatgc 

CCTCCCAGAT ACCAAAGTCA ACTTCTaCGC 
TGTCCCAGAC ACCAAGGTTA ACTTCTATGC 
TGTCCCAGAC ACCAAGGTTA ACTTCTATGC 
TGTCCCAGAC ACCAAGGTCA ACTTCTaTAC 



260 

CT&GAAGAGG ATGGAGGTCG GGCaGCaGGC 
CTGGAAGAGG AlAGAGCTCG GGCaGCaGGC 
TTGGA4AAGA ATGGAGGTGG AJCa*C*GGC 
TTGGAAAAGA ATCAAGGTGG AaGaaCAGGC 
CTGGAAGAGG ATGGAGGTCC ACCaGCaGGC 
CTGCAAGACG ATGGAGGTCC ACCAGCAGGC 
CTGGAAGAGG ATGCACGTCG GGCAGCAGGC 



CGTAGAAGTC 
TCTAGAAGTC 
CATAGAAGTT 
TGTAGAAGTT 
TCTGGAAGTC 
CATGGAGGTC 
TCTGGAAGTC 



CCTGCACCTC 
CCTGCAGCTG 
CCTTCACCTT 
TCTTCACCTT 
CCTGCGGCTG 
CCTGCAGCTG 
CCTGCAGCTG 



TCCCAGGGCC 
TGGCAGGGCC 
TGCCAAGGCC 
TGGCAAGGCC 
TGGCAGGGCC 
TCGCAGGGTC 
TGGCAGGGCC 



CATCTGGATA 
CACATGGaTA 
CATATAGACA 
CATATAGACA 

cacgtcgaca 

CATGTGGACA 
CATCTGGATA 



000 

TGCCCCTGCT 
TGGCCCTGCT 
TGTCCCTGCT 
TGTCTCTCCT 
TGGCTCTGCT 
TGGCCCTGCT 
TCGCCCTGCT 



AACCCGTCAG 
AAGCCATCAG 
AAGCCATCAG 
AAGCCATCAG 
AACCTCTCAG 
AACCTCTCAG 
AACCCGTCAG 



CCCTCCACAT GCCCCC---T CACCTC---C 

CCTCCCAGAT GCCGCC---T C6CCTC---C 

CCCTCCAGAT ACCACC---C CACCTG---C 

GCCTCCAGAC GCCACC---C AAGCCG---C 

CCTTCCAGAT CCAACCCCCT CCGCAG---C 

CCTTCCAGAC GCATCCCCCT CCTCTGCCaC 

CCTTCCAGAG GCAACC---T CTGCTG-"C 



GTCCGAAGCT 
CTCAGAACCT 
CTCACAAGCC 
CTCACAACCC 
CTCACAAGCT 
CTCACAAGCC 
CTCACAAGCC 



1)00 

TGGCCTTCGC 
TGGCCTTCGC 
TCGTCTACGT 
TGGGCTACGT 
CGGCCTCCGC 
CGGCCTGCGC 
CAGCCTGCGC 



TCCACTCCGA 
TCCACTCCCA 
TCCACTCCGA 
TCCACTCCCA 
CCCACTCCCA 
CCCACTCCCA 
TCCACTCCGA 



650 

CCTCCGCGGA 
CCTCCGGCCA 
CC TCCGGGGG 
CCTCCGGGGG 
CCTGAGCGGA 
CCTCCGCGGA 
CCTGCGGGGA 



CTCCCTCACC 
CTCCCTCACC 
CTTCCTCCCC 
CTCCCTCACC 
CTCCCTCACC 
TTcaTCACC 
CTCACTTACC 



AAGCTCAACC 
AAGCTCAAGC 
AAACTGAAGC 
AAACTGAAGC 
AAGCTGACCC 
AAGCTCACGC 
AAGCTGACCC 



TGTACACACG 
TCTACACGGG 
TGTACACGGG 
TCTACACGGG 
TGTACACAGG 
TGTACACTGG 
TGTACACAGG 



ggaggcctgc 
cgacgcctgc 
agacgtctgc 
ggaggcctgc 

GGAGGCCTGC 
AGAGGCCTCC 
GGAGGCCTGC 



ctcctgcccg 
gtcctgccgg 
atcctgcagg 
atcctgcagc 
atctttcggg 
atcctgcagg 
atcctgcgcg 



AGCCTCACCA 
ACCATCACCA 
AGCCTCACTT 
AGCCTCACTT 
AGTCTCACCT 
AGCCTCACCT 
AGCCTCACCT 



600 

ACAATCACTC 
ACCATCACTG 
ACACTCACAG 
ACACTCACAG 
ATATTCACTG 
ACATTTGCTG 
ACATTCACTG 



AGGACAGCCG 
AGCACAGGGC 
AGCAGAGGGG 
AGGAGAGCGG 
AGCAGAGGGG 
ACGAGAAGGG 
CGAAGAGGAC 



GCCACCCCCT 
GCCACGCCGT 
CCCACGCCCT 
CCCAGGCTCT 
CCCACGCCCT 
GCCACGCCCT 
CCCACGCCCT 



CTTGCTCAAC 
GTICGCCAAC 
GCTAGCCAAT 
GCACCCCAAT 
ACCGGCCAAC 
CTTGGCCAAC 
GCTCCCCAAC 



CTCTCCTTCG 
CTCTGCTTCG 
CACTCCTTCG 
CACTGCTTCC 
CCCTCCTTCC 
CCCTCCTTCC 
CCCTCCTTCC 



CTGACACT7T 
CTGACACTTT 
TGGATACTTT 
CCCATACTTT 
TTCATGCTTT 
TTCATACTTT 
TCGATACTTT 



GCCTCTGGGA 
GGCGCTCCGA 
GGTACTGGGA 
GGTGCTGGGA 
GGCCCTGGGA 
GGCCCTGGGA 
GGCACTGGGA 



CCGCAAACTC 
CTGCAAACTC 
CTGCAAGCTC 
CTGCAAGCTC 
6TCCAAGCTC 
GTGCAAACTT 
6T6CAAACTT 



360 

TCTTCCCAGC 
TCTTCCCACC 
TCCTCCCAGC 
TCCTCCCAGC 
GCATCCCAGC 
TCCTCCCAGC 
TCCTCCCAGC 



GCCCAGAAAC 
GCCCAG---C 
CCTCAGAAGG 
CCTCaGAACG 
GCCCAGAAAC 
GCCCAGAACG 
GCCCAGAAGG 



TTCCCAGTCT 
TTCCGAGTCT 
TTCCGGGTCT 
TTCCGGGTCT 
TTCCCAATCT 
TTCCGCAACT 
TTCCCAATCT 



CCTGCCAGCC 
CTTTCCACCC 

caccagacac 
caccagacag 

CATGCGAGGC 

catctgacgc 
catctgacac 



AAGCCATCTC 
AAGCCATCTC 
AATTGATGTC 
AATTGATGTC 
AAGCCATCCC 
AAGCCATCCC 
AAGCCACCTC 



ACTCCAATTT 
ACTCCAATTT 
ACGCCAACTT 
ACTCCAACTT 
ACTCCAATTT 
ACTCCAATTT 
ACTCCAACTT 



GOO 

acacatgacc 
acagatgacc 

ACAGGTGACA 
ACAGCTGACC 
ACAGGTGACC 
ACAGGTCACT 
ACAGGTGACC 



aggtgtgtcc 
acgtgcctcc 

TGC TGCTGCC 
TGC C 

cagtgctatc 
tggtgctccc 
acgtcctcct 



ACC-TGGCCA 
ACC-TCGGCA 
ACC-CTGCTC 
ACT-CCCGTG 
ACCCCCGGCA 
ACCCCCGCCA 
ACCCCCGGCA 



tatccaccac 

CATCCACCAA 
CACCGACCAA 
TaCCCGCCAA 
CCTCCATCAC 
TCTCCACCAC 
TGTCCACCAC 



660 

AACATTGCTT 
AACACTGCCT 
GTCACTGTGT 
GTCACTGTGT 
ATCACTGCCT 
ACCACTGCCT 
ACCAATGCCT 



CTGCCACACC CTCCCCCC-- 
6TGCCACACC CTCCCTC--- 
CaTGCCAACC ctcc 

cacgccaacc ctcc 

-acgccatgc cttccacgcc 

GTCCCATCCC TTCT-GCaCC 
GTCCCACCCC CTCT-CCACC 



GCCACTCCTG AACCCC--GT 
ACCACTCCCC AACCCC--AT 
ACCACTCCC- AACCC-TCAT 
ACCACTCCC- AACCC-TCAT 
CCCACTCCC- AACCCC-T6T 
TCCACTCCC- AACCCCC-CC 
ACCACTCCTC A-CCCC-TCT 



700 

CCA—GCCG- 
CCA--CCGC 
CAAAC-CGG 
CAAACCGGCT 
CCACCACGGG 
TCA---CCCG 
C GCCCC 



--'CTCTCAG CTCACCGCCA 

-••CTCTCAG CTAACCCCCA 

---TCATTAC CTTCTTACCA 

TCTTTOTTAC CTTCTTACC6 

---CCACCAC CTCACT 

-"CCATCAG CTCACCGCCA 

TC---ATCAC CTCAGCACCA 



7«12 

GCCTGTCCCA TCCACACTCC AG 

GC 

GT 

CC 

GC 

a 

CC 



Fig 4. Alignment of the nu- 
cleotide sequences of mamma- 
lian Epo cDNAs. The mouse se- 
quence corresponds to the one 
previously reported." PCR-pro- 
duced human sequence was 
obtained from hypoxia-induced 
Hep3B cell line and is in total 
agreement with the sequence 
previously published.*^*^^ Mon- 
key {Macacs mufatta) (Gen- 
Bank; accession no. LI 0609), 
rat (L10608), sheep {L10610), 
pig (LI 0607), cat (LI 0606), 
and dog (LI 3027) were ampli- 
fied from kkinev-purified cDNAs. 
AmplificatKMns of the human, 
monkey, rat, and sheep were 
realized using the ATG (resi- 
dues 1 to 20) and NCOl (resi- 
dues 723 to 742) primers. The 
line under residues 41 to 66 
represents the specific SP1 
primer used for the cloning of 
the pig, cat, and dog se- 
quences. For these three mam- 
mals, sequences from SP1 are 
derived from the data pre- 
sented in Fig 3. Arrows indicate 
the start and the and of the cod- 
ing sequence (propeptide and 
mature protein). 



Standard PCR reactions were performed in a 100 /it volume 
conuining 50 mmoI/L Tris-HQ, pH 8.3. 50 ramol/L KO, 1.5 
mmol/L Mga,. 0.01% wt/vol gelatin, 200 fimol/L dNTPs. 30 
pmoI/L of each sense (SO and anlisensc (3*) primers, and 0.5 U of 
Taq DNA polymerase (Pcrkin Elmer Cetus, Emeryville, CA). After 
an initial denaturation step for S minutes at 9S*'C, routinely 35 
cycles of PCR were performed on the DNA Thermal Cyder (Pcrkin 
Elmer Ccius). A denaturation step was performed at 94"C for I 
minute. For each sample and/or the primer pair used, annealing 
temperatures were optimized (from 45*C to 60**Q for a 2-minulc 
reaction. A 3-minute elongation step at 72X ended each cyde. 

Amplification products were analyzed on 1% to 3% agarose gel- 



Tris-acetate-EDTA (TAE). Nu Sieve or Seaplaquc agaroses (FMC, 
Rockland, ME) were used for preparative purposes. Specific PCR 
products were recovered from the gel by the use of the Gene Dean 
II kit (Bio 101, La Jolla. CA), by Spin X cenlrifugaiion (Costar, 
Cambridge, MA), or by standard phenol/chloroform extractions of 
melted gel slices. 

Subcloning and Sequencing 

Small (250 to 330 bp) amplified genomic firagmcntt were directly 
subjected to automated sequencing, using the PCR primers as DNA 
sequendng primers. 



MAMMALIAN ERYTHROPOIETIN SEQUENCES 
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Human MGVHECPAWLWLLLSLLSLPLGLPVLGfAPP 

Monkey MGVHECPAVLWLLLSLVSLPLGLPfVPGAPP 

Rodents MGVPERPT-LLLLLSLLL IPLGLPVLCAPP 

Sheep MGVRDCTPLLLLLLSLLLFPLGLPVLGAPP 

Pig ECPAR-LLLLSLLLLPLGLPVLGAPP 

Horse ECPA-LLLLLSLLLPPLGLPALGAPP 

Cornivores ECPA-LLLLLSLLLLPLGLPVLGAPP 

Dolphin EC-LLLLLLPSLLLLPLGLPVLGAPP 



Fig 5. Predicted amino acid sequences of Epo propeptide. The amino acid sequences are derived from data obtained from both genomic 
IV1 /EX2R and cON A' amplifications. The numbering correspoiKls to the human sequence. Ala 1 is the first amino acid of the human mature 
protein. The ascending arrows show the site of the cleavage by the signal peptidase, as determined for the human and cynomolgus monkey 
proteins. 



PCR-generated cDNA products were blunt-ended using KJenow 
fragment and 5' phosphorylatcd by T4 DNA kinase (Boehringer 
Mannheim, Indianapolis, IN), They were then cloned into the Sma 
\ site of the phagcmid pBluescript II SK (Straiagenc, La JoUa, CA). 
The doubie-stranded plasmid was sequenced using the SK and ICS 
sequencing primers. 

In all cases, sequencing reactions were performed on an Applied 
Biosystems 373A Automated DNA Sequencer using the DyeDeoxy 
Terminator Sequencer kit (Applied Biosystems, Foster City, CA) 
and thermal cycling with Taq DNA polymerase (Promega), as previ- 
ously described." To avoid errors, for each species, samples derived 
from 4 10 8 different amplifications were isolated and subcloned. In 
each case, both strands were sequenced. A very low error rate was 
encountered, in keeping with the 100% agreement between the rat 
cDNA sequence (see Fig 4) and that recently reported by Nagao et 
al" 

Mammalian Expression 

Inserts containing full-length mammalian Epo coding sequences 
were subcloned into pSG5 plasmid (Slratagene) and were tran- 
siently expressed in Cos7 cells. Seventy-two-hour supematants 
were tested for their ability to sustain cellular proliferation of the 
Epo-dependenl HCD57 cell line." 

RESULTS AND DISCUSSION 

Because of the known biologic cross-reactivity between 
various mammalian Epos and the presumption of strong 
homology in the coding sequences, a logical cloning strategy 
would be to screen cDNA libraries with moderate to low 
stringency. However, Epo mRNA is expressed at barely de- 
tectable levels in all tissues except in hypoxic kidney and 
li vcr.^**" Therefore, it would be necessary to generate librar- 
ies for each of the species of interest. Our primary research 
interest is in structure-function relationships of Epo and, 
therefore, we require full coding sequences of the mature 
Epo protein, rather than full cDNA sequences. Accordingly, 
we elected to use a PGR cloning strategy predicated on 
known strong homologies in Epo cDNA sequences that 
flank the region encoding the mature protein. There was 
sufficient conservation of sequence around the NCOl site 
in the 3' untranslated rc^on (UTR) that a single primer 
could be used to amplify full-length mature Epo 3' coding 
sequences in the rat, sheep, pig, and cat as well as in the dog. 
However, we were unsuccessful in finding a universal 5' 
primer and therefore had to generate spedcs-specific se- 
quences from amplifications of genomic DNA. This two- 



stage PCR strategy proved to be satis fac to r>' in generating 
highly accurate sequence information with a minimum ex- 
penditure of time and resources. In all instances, there was 
full agreement between sequences of complementar\' 
strands. 

Preparation of Mammalian Epo cDNAs 

Primer design for PCR amplification of Epo. The se- 
quences of Epo genes from three species have been already 
published: two primates, the human and Cynomolgus 
monkey {Macaca fasciadaris),^^ and a rodent, the mouse 
{Mus muscidus)?^-^^ There is substantial homology at the 
nucleotide level not only in the coding sequences, but also 
in portions of introns and in 5' and 3' untranslated se- 
quences of exons I and 5. respectively. A number of 
primers, corresponding to conserved nucleotide sequences 
between the mouse and human, were synthesized and tested 
for their ability to produce genomic PCR fragments from a 
wide variety of mammalian cell lines. A 4-kb human geno- 
mic clone and genomic DNA extracted from the human 
hepatoma cell line Hep3B were used as control templates 
for the PCR amplifications. 

In particular, two pairs of primers, IV1/EX2R and EX5/ 
NCOl (Fig 1), directed the amplifications of fragments of 
330 and 250 bp, respectively, from genomic DNA of all 
human and mammalian cell lines that we investigated. IV 1 
is a 25-mer oligonucleotide localized in inlron I, 216/217 
bases upstream from the start of exon 2. The human and 
mouse are identical except that the human sequence con- 
tains an extra G nucleotide at position 996. IV 1 primer 
corresponding to the mouse sequence was synthesized. The 
23-mer EX2R ends 28 nucleotides upstream from the 3' end 
of exon 2. EX5 is a 22-mer, beginning 34 bases downstream 
from the 5' end of exon 5. NCOl represents a 20-bp. 100% 
conserved, DNA fragment, starting 112/113 bases down- 
stream from the TGA stop codon and contains the unique 
NCOl site present in the human and mouse gene. 

An ATG primer was also synthesized, corresponding to a 
20 oligonucleotide stretch, extending both 5' and 3' from the 
initiator methionine codon. This sequence is also totally 
conserved between primates and mouse. Whereas combina- 
tions of cxonic primers (ATG/EX2R. EX5/NCOI. and 
ATG/NCOI) were able to direct amplification from cDNA 
prepared from RNA of the human Epo-producing cell line 
Hep3B, no amplification was obtained on cDNA prepared 
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Fig 6, (A) Alignment of the primary structures of mature mammalian Epo proteins. The human. Macaca fascicularis. and mouse amino 
acid sequences were previously reported. The residue numbers correspond to the 1 66 amino acids of the mature human protein. Plain 
boxes indicate the positions of the predicted four a helices in the human sequence.^* Dashed boxes show the N- and 0-glycosylation sites. 
Boundaries between exons are indicated by the vertical lines. (B) Schematic representation of mammalian Epo. The invariant amino acids 
among the eight sequences are represented by the solid boxes. The localization of each of the four a helices is underiined. The three 
glycosylatlon sites are shown as diamonds. The main disulfide bridge is indicated. The small arrows under the sequence indicate the short 
SH-bridge, which is missing In the rodents. 



from uninduced or hypoxia-induced BHK, MDOK. LLC- 
PKI. MDCK, CRF-K, PALI-K. and SPl-K cells (results 
not shown). Therefore, these renal cell lines apparently do 
not express Epo mRNA, either constilutively or after hyp- 
oxic stress, at a level detectable even by RT-PCR. Further- 
more, no Epo protein was detectable by radioimmunoassay 
in the supernatants of cells maintained overnight in 1% Oj, 
5% COj. and 94% Nj at 37*C 

The amplification of species-specific IV1/EX2R and 
EX5/NC0I genomic fragmenU allowed us to design a strat- 
egy shown in Fig 2 for obtaining cDNA clones containing 
the complete coding sequence of mature Epo protein from 
various mammals. The 1V1/EX2R primer pair resulted in a 
genomic fragment ('^330 bp) extending from the 3' third of 
the first intron through 80% of cxon 2, encoding the COOH- 
cnd of the propeptide and the NHj-terminal amino acids of 
the mature protein. EX5/NCOI primers amplified a seg- 
ment of cxon 5 containing the COOH-end of the Epo pro- 
tein and a 3' untranslated sequence of about 1 14 bp down- 



stream of the stop codon. DN A sequencing of these 5' and J 
fragments permitted, if necessary, the design of a second 
generation of species-specific primers (SP) localized outside 
the coding part of the mature protein (ie. in the sequences 
encoding the propeptide and the J untranslated region). 

Amplification of partial intron l/exon 2 genomic 
clones. We first explored the efficacy of the IV 1 and EX2R 
primers for amplification of genomic fragments from differ- 
ent purified DNA templates. In all the reactions, fragments 
of predicted size were the main (and, when using the most 
stringent annealing temperature, often the only) products 
delected by agarose gel electrophoresis. The (direct) se- 
quences of these PCR-generated fragments are presented in 
Fig 3. The nucleotide alignment includes 6 different orders 
of mammals: one primate, the human (Homo sapiens); two 
artiodactyls, the sheep {Ovis aries) and pig {Sus scrofa)\ one 
perissodactyl, the horse (Equus cabaUus)\ one cetacean, the 
spotted dolphin (Stenella plagiodon)\ three carnivores, the 
dog(Canis familiaris), cat {Felis catus), and lion {Panthera 
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Table 1 . Degree of Conservation Arr^ong Mammals 
of the Mature Epo Proteins 



% Identity 



Primates 




Human/Rhesus monkey 


90.5 


Rhesus/Cynomolgus 


98.8 


Rodents 




Human/ mouse 


79.8 


Human/ rat 


82.1 


Mouse/rat 


93.5 


Artiodactyls 




Human/sheep ■ 


81.0 


Human/pig 


81.6 


Sheep/pig 


887 


Carnivores 




Human/cat 


84.5 


Hunnan/dog 


84.5 


Cat/dog 


95.2 



The percentages of identity among the various sequences were deter- 
mined from the amino acid alignment reponed in Fig 6A. 



leo)\ and two rodents, the mouse {Mus muscuhts) and ham- 
ster {Cricetus cricetus). 

The cat and lion exhibited an almost complete sequence 
identity, with only one T to G nucleotide substitution near 
the 5' end of the amplified portion of the first intron. 

The ability of the intronic IV I to anneal to genomic DNA 
from various species and, in combination with EX2R, to 
amplify PGR fragments of identical size, showed a high de- 
gree of conservation of sequence and position between 
mammals. This finding suggests that IV 1 sequence may be 
involved to some extent in the regulation of the Epo gene 
expression. The comparison of IV1/EX2R sequences also 
showed two remarkably conserved intronic sequences: 
AT(T/A)GAATGAA(G/C)GC (nucleotides 1046 to 1058 
in the human gene) and A(A/T)GGTN(G/C)GGG (nucleo- 
tides 1085 to 1094). 

Amplification of partial exon 5 fragments firom genomic 
DMAs. PGR reactions were also performed applying the 
EX5/NCOI primer pair to various purified genomic DNAs, 
As we previously observed for the amplification of IVl/ 
EX2R, in all of the tested samples a unique main band of 
about 250 bp (as predicted for human and mouse genes) was 
detected on analytical agarose gel electrophoresis. Direct 
DNA sequencing of these fragments showed that the bulk of 
the purified PGR product corresponded to the expected Epo 
exon 5 sequence. However, for some DNA samples, which 
were purified from horse kidney and several cell lines (BHK 
and LLC-PKI in particular), unknown sequences were pres- 
ent to various extents. These minor contaminating PGR 
products generated extraneous peaks in the sequence data, 
resulting in ambiguities at several positions. Nevertheless, 
computer-edited analyses of generated 3' noncoding se- 
quences were sufficient to design, if necessary, specific 
primers with greater than 95% acxniracy (data not shown). 

Cloning of partial cDNAs encoding the complete mature 
Epo protein. As we previously mentioned, wc were unable 
to obtain any amplification with cDNAs prepared from 
rcnal-dcrivcd mammalian cell tines. Therefore, wc tested 



our battery of primers on reverse-Iran scribed RNAs pre- 
pared from kidney of several mammals. We have success- 
fully amplified Epo cDNA from one primate, the Rhesus 
monkey (Macaca mullata)\ two carnivores, the dog {Canis 
familiaris) and cat {Fetis catus)\ one rodent, the rai [Rattus 
norvegicus); and two 'artiodactyls, the pig {Sus scrofa) and 
sheep (Ovis aries). The aligned nucleotide sequences are 
presented in Fig 4, They have been deposited with the Gen- 
Bank Data Library and were given the following accession 
numbers: LI 0609 (Rhesus monkey), LI 0606 (cat), LI 0608 
(rat), L10607 (pig), L10610 (sheep), and L13027 (dog). 

The Rhesus monkey, rat, and sheep cDNAs were ampli- 
fied using the ATG/NCOI primer-pair combination. The 
pig, cat, and dog sequences were obtained by the use of a 5' 
24-mer specific primer (SPl, 5' TGCTTCTGCTATCTT- 
TGCTTGCTGC 3^). 1V1/EX2R sequencing showed that this 
exon 2 sequence was shared by the three Sf>ecies. In all PGR 
amplifications, NGOI was used as the reverse primer. Con- 
servation of the NCOl sequence among mammals suggests 
a potential biologic role of this nucleotide sequence (per- 
haps in modulating the stability of Epo mRN.A).^'* Consider- 
able homology is also found in the sequenced 3' untrans- 
lated segment of exon 5 (Fig 4). 

When compared with human Epo, the overall i)ercent- 
ages of nucleotide identity are, respectively, 93.5% for the 
Rhesus monkey. 86.7% for the cat, 85.7% for the dog and 
the pig, 83% for the sheep, and 75.5% for the rat. The Rhe- 
sus monkey and Cynomologus monkey'* were 99.6% iden- 
tical. The mouse*'-" and rat nucleotide sequences showed 
greater than 93% identity. The cat and dog sequences were 
91% identical and the two sequenced aniodactyls, the pig 
and sheep, showed 88% identity.* 

Comparison of Mammalian Epo Primary Sequences 

Propeptides. The predicted amino acid propeptide se- 
quences of several mammals are presented in Fig 5. In the 
primates, there is strong conservation of the sequences of 
deduced propeptides, with only two amino acid substitu- 
tions at positions - 1 1 (Leu in humans v Val in both mon- 
keys) and -2 (Leu in humans v Pro in both monkeys). Ex- 
pression of Gynomolgus monkey Epo gene in cultured 
mammalian cells resulted in the production of mature 
monkey Epo that was elongated at the N-terminus by three 
additional residues, Val-Pro-Gly." As the Rhesus monkey 
has the same substitutions, an identical site of cleavage is 
likely. The Leu to Pro amino acid replacement at -2 is 
probably responsible for the differential activity of the signal 
peptidase observed between humans and monkeys. The 
three rodents (mouse, rat, and hamster) have identical pro- 
peptide amino acid sequences. The two carnivores also 
share one propeptide sequence. 

Mature protein. A comparison of the three previously 
published and our six deduced amino acid sequences of 
mature Epo proteins is shown in Fig 6A. Epo is highly con- 



• Several PGR attempts on total RNa purified from the horse 
kidney were unsuccessful, even when using 5- and ^-specific equine 
primers (SP), presumably because analysis on \% formaldehyde/ 
agarose gd showed degradation of the RNA. 
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served among mammals. The amino acid alignment 
showed that more than 63% of the molecule is composed of 
invariant amino acids (106 residues) (Fig 6B). Most of the 
observed substitutions are conservative, involving residues 
with similar physical and chemical properties. The calcu- 
lated percentages of sequence identity are shown in Table 1 . 

There is 100% conservation of Cys7 at the N-lerminal 
portion of the polypeptide and Cys 1 6 1 near the C-terminus. 
This disulphide bridge is essential to the formation of a 
stable and functional cytokine.^ In contrast, whereas Cys29 
is invariant in all the species we examined, Cys33 is present 
only in the primates, sheep, pig, and cat, but not in the two 
rodents, in which there is a proline. The lack of functional 
importance of this short disulfide loop is underscored by 
muteins, reported by Boissel et al,'' in which tyrosine sub- 
stitutions at each or both of these sites have no effect on 
Epo's biologic activity." 




Mouse Rat Cat Dog 




Human Monkeu Moutt Rat Cat Oog Shaap Pig 



There is also \00% conservation of all three asparagine 
residues responsible for N-linked glycosylation. Even 
though Epo deprived of its carbohydrate either by enzy- 
matic cleavage^* or by production in bacteria-*^ has full in 
vitro biologic activity, survival of the hormone in the circu- 
lation depends on N-linked glycosylation.^^ In contrast, the 
O-linked glycosylation site (Serl26 in -human Epo) is not 
essential for biologic function as it is missing in mouse and 
rat Epos. 

As shown in Fig 6A, there is very high conservation of 
sequence in regions that, by algorithms based on primar>' 
structure, are predicted to be a helices. " The few differences 
in sequence within these helical regions are conservative 
replacements. Indeed, some sites within these predicted a 
helices are conserved in corresponding helices of other cyto- 
kines.^ In contrast, regions predicted to be inierhelical 
loops are less well conserved, particulariy the 14 residue 
stretch between residues 1 16 and 130 of human Epo. In a 
companion report," we show that a mutein with a deletion 
of residues 1 1 1 through 1 19 has normal stability and bio- 
logic activity, whereas a deletion of residues 122 through 
1 26 fails to produce a detectable protein, probably owing to 
markedly impaired stability. 

Similar overall 4 a-helical bundle structure exists or is 
predicted for several other cytokines/hormones, such as 
growth hormone (GH), granulocyte-macrophage colony- 
stimulating factor (GM-CSF). interleukin-3 (IL-3), IL-4, 
and IL-5.^^ Like Epo, GH exhibits a high degree of primary- 
sequence conservation and biologic cross-reactivity be- 
tween mammals. The mouse, rat, pig, and sheep proteins 
show about 80% of amino acid identity with human GH. 
On the contrary, in other cytokines there is more sequence 
diversity among species. For example, the human and mu- 
rine forms of IL-5, GM-CSF, and IL-3 show, respectively, 
only 69%, 54%, and 26% amino acid sequence identity. Fur- 



Fig 7. Phylogenetic Kneages derived from analyses of mamma- 
lian Epo cDNA sequences encoding the full-length mature proteins 
from seven species. (A) Strength of groups in the maximum parsi- 
mony tree found on examining aJI 945 unrooted trees formed by 
seven terminal taxa. This tree of lowest length required 374 base 
substitutions. Each link between ancestral nodes has a circled num- 
ber; this strength of grouping number is the minimum number of 
substitutions that must be added to the length of the maximum 
parsimony tree to find a tree that breaks down the barrier (nrK>ves 
one or more sequences) between the two groups separated by the 
Interior link, (B) The phylogenetic tree derived from the maximum 
parsinumy reconstruction. On the basis of other niolecular evi- 
dence involving comparative amino add sequence data from mono- 
tnme; marsupials, and many eutherian species.**-'* the root of 
this Epo phylogenetic tree is placed on the Interior link separating 
the artiodactyl group from the primate, rodent and cat group. The 
numbers on the Intemodal links represent the numbers of base 
pairs by wNch the nodal ancestral and descendant sequences 
differ. MYA, millions of years ago* as inferred from paleontok>gic 
views of eutherian phytogeny. (C) Parsimony reconstruction on that 
portion of the cDNA sequences that are codons for amino adds. 
The numbers shown as a fractkxi on each link are. In the nunr>era- 
tor. the number of amino add changing base replacements and. in 
the denominator, the number of silent base replacements. The 
computer algorithm that performed this calculotkm is described in 
Czekisnlak «t al** (see the Fig 2 legend of this reference). 
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Fig 8. Phylogenetic lineages derived from analyses of the man^- 
mallan Epo intron 1-exon 2 sequences. (A) Strength of groups in 
the maximum par8tnK)ny tree found on examining all 135,135 
trees fomfied by nine terminal taxa. As the two feloid sequences 
(those of cat and lion) are nearly identical, they were treated as a 
single taxon in this strength of grouping arialysts. The nrwximum 
parsimony tree for the segnr>ent (about 285 bp) of intron 1 and exon 
2 from 10 species required 286 base substitutions. The circled 
nun^rs are the strength of grouping numbers (as defined in the 
Fig 7 A legend). (8) The near most parsimonious tree that groups 
horse with the a rtiodactyl -cetacean branch. This tree required 287 
base substitutions. The numbers on links represent numbers of 
base pairs by which the nodal ancestral and descendant sequences 
differ. 



thermore. unlike Epo or GH, those proteins lack interspe- 
cies biologic cross-reactivity. 

Expression of the mammalian Epo hormones. Mam- 
malian Epo cDNAs were subcloncd into the pSG5 plasmid 
and transiently expressed in the monkey Cos? cell line. Su- 
pematants of the transfected cells were able to sustain the 
cellular proliferation of the murine Epo-dependent HCD57 
cell line, showing the biologic cross-reactivity between spe- 
cies. As the human Epo antiscra used in our radioimmuno- 
assay bind to Epos from other species with variable affinity, 
the amount of produced protein was difficult to determine 
accurately. Experiments arc in progress to further character- 
ize the relative affinities of the various mammalian Epos 
toward murine and human Epo receptors. 

Phylogenetic analyses of Epo sequences. Nucleotide se- 
quences encoding the fulHcngth mature Epo protein were 
analyzed by the maximum parsimony method^' An all 
trees set of computer programs determined the parsimony 
length of each of the 945 unrooted trees formed by the seven 
cDNA sequences (the terminal taxa) (Fig 4) and, on order- 
ing those 945 trees according to increasing length, identified 
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the minimum number of extra nucleoiide subslitutions 
needed to break up each group in the maximum parsimony 
(lowest length) tree. As shown in Fig 7A and B, the mouse 
and rat are strongly grouped, as are the human and monkey. 
The dog and cat are not as strongly grouped, and the pig and 
sheep are very weakly grouped. These grouping results are 
in accord with accepted phylogenetic views, which place the 
intraordinal divergence, on the one hand, between dog and 
cat and, on the other hand, between pig and sheep at about 
50 to 60 million years ago. As expected with the parsimony 
reconstruction on only the codon sequences, silent base 
substitutions occurred much more readily than did amino 
acid replacements (Fig 7C). 

The Epo intron 1-exon 2 sequences shown in Fig 3 were 
also analyzed by the maximum parsimony method. Figure 
8 A shows strength of grouping results based on all 135,1 35 
unrooted trees formed by nine terminal taxa (in this case, 
the cat and lion sequences, which are nearly identical, were 
treated as a single taxon). Because of the quite limited phy- 
logenetic information in these relatively short sequences, 
there are alternative maximum parsimony trees (three were 
found) and the consensus of these alternative trees does not 
depict (Fig 8A) a consistently bifurcating tree. Nevertheless, 
the rodents (mouse and hamster) are strongly grouped, as 
are the feloids (cat and lion); at more moderate strength, the 
cetacean (dolphin) is grouped with the artiodactyls (in partic- 
ular, with sheep). There are other molecular data as well as 
some paleontologic data that depict cetaceans originating 
from early artiodactyls.^*^^*^' Figure SB shows a near maxi- 
mum parsimony tree. This tree has a length of 287 base 
substitutions, whereas the length of each maximum parsi- 
mony tree is 286 base substitutions. A maximum parsi- 
mony tree can be obtained by exchanging positions of the 
horse and human branches, thus placing the human rather 
than the horse close to artiodactyls. However, the grouping 
of horse with the artiodactyl-cetacean clade (ie, the near 
maximum parsimony tree) is in better agreement with well- 
supported phylogenetic evidence on mammalian ordinal re- 
lationships than is the maximum parsimony tree. 
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In order to delineate functionally important domains 
in erythropoietin (Epo), we have prepared and tested a 
series of amino acid replacements at 51 conserved sites 
predicted to be on the surface of the molecule. Alanine 
replacements permitted preservation of a-helical struc- 
ture. Wild type and mutant Epo cDNAs were transiently 
expressed at high levels in COSl and COS7 cells. The 
biological activity of wild type and mutant Epos was 
assayed in three Epo-responsive cell types: primary mu- 
rine erythroid spleen cells, the murine HCD57 erythro- 
leukemia cell line, and the human UT7-EPO leukemia 
eel! line. When Arg" on predicted Helix A was replaced 
by Ala, biological activity was substantially reduced, 
whereas replacement with Glu resulted in total loss of 
specific bioactivity. In a similar manner, the mutein 
Arg'*** Ala in Helix C was completely lacking in biolog- 
ical activity, whereas both Ser**** Ala and Leu*** Ala 
had decreased bioactivity. In Helix D, the mutein Gly^* 
-» Ala had markedly decreased bioactivity* whereas that 
of the adjacent lys"* -» Ala mutein was moderately im- 
paired. In contrast, Ala replacements at three nearby 
sites on Helix D (147, 146, and 143) resulted in muteins 
with increased bioactivity. In conclusion, our mutagen- 
esis experiments have identiHed functionally important 
domains on the surface of the Epo molecule, at sites 
comparable with those established for other cytokines. 



Erythropoietin (Epo)^ is a 30.4-kDa glycoprotein hormone 
that regulates red blood cell production (1, 2). Biochemical 
studies (3, 4) led to moleciilar cloning of the Epo gene (5, 6), 
High level expression has enabled recombinant himian Epo to 
be used very effectively in the treatment of anemias. Moreover, 
the experimental use of recombinant Epo has greatly advanced 
our understanding of the molecular mechanisms imderlying 
erythropoiesis. 

Epo is a member of an extensive cyU>\dne family which also 
includes growth hormone, prolactin, interleukins 2 through 7, 
G-CSF, GM-CSF, M-CSF, oncostatin M, leukemia inhibitory 
factor, and ciliary neurotrophic factor (7-9). Although sequence 
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homology is weak, genes encoding these proteins have similar 
numbers of exons and share a clear relationship between in- 
tron-exon boundaries and secondary structure (9). All of these 
cytokines are predicted to fold into a common compact globular 
structure, consisting of four amphipathic a-helical bundles. 
Such theoretical models of the structures of human growth 
hormone (HGH) (10) and ILr4 (11) have been in good agreement 
with the subsequent stnictiores established by x-ray diffraction 
(HGH) (12. 13) or by multidimensional NMR (IL-4) (14, 15). 
Moreover, the crystal structures of GM-CSF (16, 17), G-CSF 
(18), monomeric M-CSF (19), IL-2 (20. 21), and IL-5 (22) are 
also in reasonable agreement with their predicted structures 
(for review, see Ref, 23). 

Epo's high carbohydrate content (39%) and the heterogeneity 
of these carbohydrate chains have thus far precluded the de- 
termination of its three-dimensional structure by either x-ray 
diffraction or by NMR. In order to gain an understanding of 
Epo structure-function relationships, we first determined the 
coding sequences of Epo from six mammals representing five 
different orders, thereby identifying regions of high homology 
that are likely to have structiural and/or functional importance 
(24). We then developed a computer-generated model of Epo's 
three-dimensional structure and have tested it by means of a 
series of scanning deletion muteins, which fully support a four 
a-helical bimdle structure (8, 25) in common with that of the 
other cytokines mentioned above. As described in this report, 
we have exploited this information in the design and testing of 
single amino acid replacement muteins that provide informa- 
tion on functionally important domains of human Epo. 

MATERIALS AND METHODS 

Construction of MTild Type and Mutant Mammalian Expression 
Plasmids—KHEPO FL12. kindly provided by CSenetics Institute (Cam- 
bridge, MA) (5), is an M13 plasmid, containing a 1.4-kilobase pair 
£coRl-£coRI human Epo cDNA insert. A 943'ba8e pair f^RI-Bg/II 
fragment, which includes the complete coding sequence of the wild type 
human erythropoietiTi as well as untranslated regions 216 base pairs 
upstream and 183 base pairs downstream, was inserted into the mam- 
malian expression plasmid pSG5 (Stratagene) (26) and designated 
pSG5-EP0/WT. 

Site-directed Mutagenesis — Site-directed mutagenesis was carried 
out by a modification of the protocol described by Kunkel et aL (27). For 
further details see Ref. 25. Since a 40-80%' mutation yield is normally 
obtained, four to five double-stranded cDNA clones from each reaction 
were sequenced with 7-deaxa-dGTP and Sequenase (U. S. Biochemical 
Corp.) (28) to verify the mutation. In addition, each mutant was 
screened by restriction mapping or, less commonly, by full sequencing in 
order to detect the presence of additional unwanted mutations. 

Production of Wild Type and Epo Muteins in Mammalian Cef/fi— COS 
1 or COSl cells, grown to 40-60% confluence, were transfected with 20 
pg of recombinant plasmid DNA/lO-cm dish using the calcium phos- 
phate precipitation protocol (29). As a control of transfection efficiency, 
in several experiments 2 pg of pCHUO plasmid (Pharmacia Biotech 
Inc.) was co-transfected and 3-galacto8idaBe activity was measured in 
the cytoplasmic extracts. 

Quantitation of lyansiently Expressed Recombinant Epos — The 
amount of secreted protein in the supematants of transfected C0S7 
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Fig. 1. Primary and secondary structure of human Epo. The amino acid sequence of the mature exported protein ia shown by the single 
letter code. Sequence letters in bold are fully conserved among eight mammalian species (24). Each of the three Mlinked glycosylation sites is 
designated by an asterisk. The predicted amphipathic a helices (A, B, C, D) are enclosed in rectangular boxes. Residues predicted to be on the 
surface are underlined. The shaded patches designate residues in the nonhelical regions of human Epo which can be deleted with no significant 
loss of specific bioactivity (25). The amino acid replacement muteins reported in this study are shown above the primary sequence. 



was determined by a radioimmunoassay which utilized a high titer 
rabbit polyclonal antiserum raised against human wild type Epo and 
produced in our laboratory (30). This polyclonal antiserum is expected 
to measure the concentration of Ala replacement Epo muteins more 
accurately than monoclonal antisera. ^^I -Labeled recombinant human 
Epo was obtained from Amersham Corp. (for further details, see Ref 
3 IX In order to obtain accurate Epo concentrations, several dilutior.s of 
each COS cell supernatant were assayed in duplicate. 

Bioassays—The dose -dependent proliferation activities of wild type 
Epo and Epo muteins were assayed in vitro using three different Epo- 
responsive target cells: 1) primary murine spleen cells, following a 
modification of the method of Krystal (32); 2) HCD 57, a murine MEL 
cell line developed by Hankins et al (33); and 3) UT7-Epo, a human cell 
line derived from the bone marrow of a patient with acute megakaryo- 
blastic leukemia (34). After 22-72 h of incubation (depending on cell 
type) with increasing amounts of recombinant Epo proteins, cellular 
growth was determined by [^HJthymidine (New England Nuclear) in- 
corporation (35). For each assay, nine concentrations of Epo were used 
(2,5-50 milliunita/ml). The counts/min in the absence of any source of 
Epo (nonspecific background) was subtracted from each experimental 
point In order to maximize the accuracy of these bioassays, the follow- 
ing controls were employed, (a) In each experiment COS cell supema- 
tanta containing Epo muteins were compared with COS cell supema- 
unts from three separate transfectiona of wild type Epo. (6) Tb correct 
for any nonspecific effect of other substances in the COS cell superna- 
tant on the growth of cells in the bioassay, wild type recombinant hu- 
man Epo (rhEpo) was added to mock-transfected COS cells and to media 
alone and assayed for bioactivity. These specific bioactivity values were 
generally identical to those of wild type rhEpo secreted from the COS 
cells, (c) For muteins with abnormal specific bioactivity, replicate trans- 
fections were done in order to minimize possible variability in the con- 
tribution of nonspecific factors in the COS cell supematants in the 
measurements of biological activity. 

RESULTS 

Our objective is to identify the functionally important do- 
mains of the Epo molecule by preparing muteinB that have 
altered specific bioactivity, *.e. significantly higher or lower bi- 
ological activity per unit mass of protein. Sudi muteins must 
satisfy two criteria: 1) efHcient secretion by COST cells and 2) 
preservation of the overall folding and three-dimensional struc- 
ture, thereby allowing full recognition (with normal or neariy 
normal binding affinity) by the polyclonal anti-Epo antiserum 
that we use in our radioimmunoassay. 

In Older to satisfy these criteria, our strategy employed the 
use of muteins that have only single amino acid replacements. 
For all but a few muteins, we have made alanine substitutions 



in order to preserve a-helical structure (36). A comprehensive 
alanine replacement scan of Epo would require the preparation 
and testing of nearly 166 muteins. Such an undertaking would 
be prohibitively labor intensive and costly. We therefore de- 
vised a strategy to reduce the number of required replace- 
ments, by applying the information we recently obtained and 
reported on mammalian Epo sequences (24), as well as on scan- 
ning deletion muteins (25). 

On the basis of its primary amino acid sequence and disulfide 
bonds, Epo is predicted to have a four-antiparallel amphipathic 
a-helical bundle structure (8, 25) in common with other mem- 
bers of the cytokine family (9). We have shown that deletions in 
nonhelical regions at the N terminus, the C terminus, and in 
the loops between helices result in the formation of Epo pro- 
teins that are readily secreted from the cell and are biologically 
active (25). These regions (Fig. 1) can be ruled out as domains 
essential for function, such as the sites involved in the binding 
of Epo to its receptor. Moreover, residues that are not conserved 
among mammals (Ref. 24 and Fig. 1) are also unlikely to be 
functionally important sites. Our previous scanning deletion 
analysis (25) suggests that the fiinctionally important receptor 
contact sites are likely to reside on the predicted a helices. 
Furthermore, since the amphipathic helices are predicted to 
bind to one another along their hydrophobic surfaces, the bio- 
logically relevant contact sites are likely to be residues pre- 
dicted to be on the external surfaces of the helices (25). These 
considerations greatly restricted the number of mutations 
needed to define critical functional domains of Epo and focused 
our analysis on the conserved surface residues of the four hel- 
ices as well as residues acyoining the helices. In addition to the 
muteins which were selected according to the above criteria, we 
prepared 3 alanine replacement muteins at conserved sites 
predicted to be in the hydrophobic interior: Trp**, Ser^*, and 
Thr****. As expected (and shown in Table I), all of these muteins 
had normal specific bioactivity. 

Pig. 1 depicts the residues excluded on the basis of the three 
above-mentioned criteria as well as the 56 replacement mu* 
teins we have prepared and tested.' Table I presents specific 
bioactivity data on all of these muteins tested in three Epo- 
responsive cell types. In general there was excellent agreement 

' 53 muteins at 51 predicted external sites. 
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Table I 

Bioactiuity of Epo replacement muteins 
The letters in the column on the left designate predicted helices (A, B, 
C, D) or interhelical loops (A-B, C-D). The amino add of wild type 
human Epo, its position from the N terminus, and its replacement 
according to the single amino acid letter code are listed under the 
mutein column. The three bioaasay columns (human UT7 cell line, 
murine HCD57 cell line, and primary mouse spleen eo'throid cells) 
show specific bioactivity of each mutein, expressed as a percentage of 
wild type human Epo bioactivity. with the background COS superna- 
tant alone subtracted from the value. The mean and standard deviation 
of the bioassay values arc listed for n s 3 determinations. The numbers 
within the brackets indicate the number of separate COS cell transfec- 
tions over (/) the number of separate bioassays that were performed, 
NS, not secreted from COS 7 cells. 



Specific bioactivity 



Helix 


Mutein 




HCD 


Spleen 






% 


wild type human Epo 


A 


S9A 


148 ± 35; [3/31 


123 ± 13; (3/31 




A 


RlOA 


91; [Ul] 


123; 1 1/1] 




A 


E13A 


111,74; [2/2] 


80, 99; [2/2] 




A 


R14A 


58, 56; [2/21 


2. 30; [2/2] 


18, 24; 12/2] 


A 


R14L 


91 ± 8; [4/6J 


86 ± 15; (4/6j 


67. 111;[2/2J 


A 


R14E 


11 ± 6 [3/51 


12 - 1; 15/31 


17. 17; [2/2] 


A 


L17A 


110; Wl] 


95; [1/1] 




A 


E18A 


70 ± 17; [3/3] 






A 


K20A 


252 - 81; [4/4] 


103 ± 21; [4/4] 




A 


E21A 


99, 69; 12/2] 


50; [1/1] 




A-B 


C29Y/C33Y 


75; [1/1] 


42; [1/1] 




A-B 


K45A 


110, 101; [2/2] 


100, 100; [2/2] 




A-B 


F48S 


146, 110; [2/21 


100, 61; [2/2] 




A-B 


Y49S 


150 ± 7; [3/3] 


146, 138; [2/2] 




A-B 


A50S 


128, 92; [2/2] 


140, 37; [2/2] 




A-B 


W51S 


101, 78; [2/2] 


89 Jt 7; [3/3] 




A-B 


K52S 


99, 90; [2/2] 


99, 88; [2/2] 




B 


Q59A 


156, 125; [1/2] 


130. 134; [1/2] 




B 


E62A 


101, 85; [2/2] 


63. 78; [2/2] 


82, 111; [2/2] 


B 


W64A 


102, 112; [2/2] 


87, 143; [2/2] 


90, 107; [2/2] 


B 


Q66A 


96; [1/1] 


140; [1/1] 


99; [yi] 


B 


G66A 


130; [Ul] 


103; [yi] 


85; [yi] 


B 


L69A 


94; [yi] 


84; Wl] 




B 


S71A 


64; [1/1] 


130; Wl] 


85; [yi] 


B 


E72A 


NS 






B 


A73G 


242 a: 25; [2/3] 


104; [1/1] 




B 


R76A 


109; [1/1] 


138; [yil 




C 


Q92A 


112; [1/1] 


98, 95; [2/2] 


113, 91; [2/2] 


C 


L93A 


123, 126; [2/21 


127, 95; [2/21 




C 


H94A 


NS 






C 


D96A 


NS 






C 


K97A 


84 ± 9; [2/4] 


77 a: 27; [2/3] 


94; [yi] 


C 


SIOOA 


98, 85; [2/2] 


131; [yi] 


104; [yi] 


C 


GIOIA 


166 * 52; [3/3] 


146 ± 26; [3/4] 


73, 152; [2/2] 


C 


R103A 


3 ± 3; [3/51 


7 ± 1; [2/33 


6; [yil 


C 


S104A 


114 ± 28; (2/41 


35 ± 17; [2/3] 




C 


T106A 


120; [Vl] 


108; (yij 




C-D 


L108A 


158 * 41; (2/41 


48 ± 6; [2/41 


12, 52; [2/2] 


D 


T132A 


120, 104; [1/1] 


98, 108; [1.1] 




D 


D136A 


108, 114; 12/2] 


88 ± 20; 13/31 




D 


R139A 


171, 91; [2/2] 


120. 109; [2/21 


90. 131; [1/2] 


D 


K140A 


189 ± 30; [3/3] 


110 ± 19; [3/3] 




D 


R143A 


276 ± 86; [3/3] 


150 ± 27; [3/3] 




D 


S14eA 


198, 269; [1/2] 


58, 110; [1/2] 




D 


N147A 


457 ± 185; [3/6] 


116 ± 22; [6/6] 




D 


R150A 


181 A 94; [3/4] 


92 ± 28; [3/4] 


28. 64; [1/21 


D 


G151A 


11 ± 4; [2/3] 


6 ± 2; [2/3] 




D 


K152A 


89, 73; [1/2] 


46, 20; (1/2) 


16, 35; (1/21 


3*D 


L153A 


95; Wl] 


83; [Vl] 


46; (y2] 


3'D 


K154A 


176 ± 60; (3/4) 


185 A 84; [3/5] 


82, 157; [1/2] 


3*D 


L15SA 


95, 128; [2/2] 


U5;{yi] 


40. 98; (2/21 


3'D 


Y166A 


105; [lyil 


65 ± 12; 13/3] 




3'D 


TI57A 


66, 104; [2/2] 


89. 91; (2/2] 


102, 104; [2/2] 


3'D 


0158A 


84, 109; [2/2] 


94, 89; [2/2] 


77, 139; (2/2] 


3'D 


E159A 


89, 97; (2/2] 


101 ± 10; [2/3] 


33, 96; [2/2] 



between the three cell bioassay systems. As discussed below, a 
few muteins displayed different biological activity depending 
on the species of the assay cell, specifically the human UT7 
versus the murine HCD57 and spleen cells. 

It should be noted that in a few of these dose-response 
curves, at the highest concentrations of Epo tested, there was a 
drop-off in the incorporation of I ^H] thymidine (for example, see 
Figs. 4B and 5C). This is due to the fact that high levels of 
biologically active Epo cause terminal maturation and cessa- 
tion of cell division. Because of this possibly confounding phe- 
nomenon, it is important to have, as in the experiments that we 
report here, a full dose-response cur\'e for each mutein over a 
wide (50-fold) concentration range. 

At three sites in human Epo, we found a marked decrease in 
specific bioactivity. (a) In predicted Helix A, replacement of 
Arg^^ by Ala resulted in considerably impaired biological activ- 
ity, particularly when assayed by the :v.o mouse cell systems 
(Fig. 2). In order to further delineate residue 14 as a function- 
ally important site, the following additional muteins were pre- 
pared and tested: Arg** ^ Leu and Arg'* Glu. As shown in 
Fig. 2, R14L had normal specific bioactivity, whereas R14E was 
nearly inactive, as determined by all three cell bioassays. The 
differential bioactivity of these Arg*^ replacement muteins with 
alanine, leucine, or glutamic acid indicates a precise stereo- 
chemical integrity at this site that is essential for Epo*s func- 
tion. When other external sites predicted to be on Helix A were 
replaced by Ala (Ser», Argl^ Glu". Leu'^ GW\ hys^, and 
Glu^^) normal bioactivity was observed. Particularly notewor- 
thy is the lack of abnormal function when the 3 Glu residues 
were replaced. An acidic residue in Helix A is thought to be 
critical for the binding of IL-2, IL-S, and GM-CSF to their 
respective receptors (23). 

(6) In predicted Helix C, the Arg'**^ — Ala mutein was com- 
pletely devoid of specific bioactivity in all three bioassay sys- 
tems (Fig. 3lA). These result* confirm and extend recent work of 
Grodberg et al. (37) who employed only the mouse spleen cell 
assay (32). Replacement of the adjacent residue Ser*" with Ala 
resulted in normal bioactivity when tested in human UT7 cells, 
but low (35%) bioactivity when tested in mouse HCD57 cells 
(Fig. 3B). In a similar manner, as shown in Fig. 4C, alanine 
replacement of Leu'*'® in the CD loop resulted in a mutein which 
had slightly enhanced bioactivity (1589b) when tested in the 
human cells, but low activity when tested in mouse HCD57 
cells (48%) or in mouse spleen cells (32%). These results indi- 
cate that this region of predicted Helix C contributes strongly 
to Epo's function. In contrast, Ala replacement of other con- 
served residues predicted to be on the sxirface of Helix C (Gin". 
he\i^\ Lys*", Ser^~, Gly***\ and Thr***^) generated muteins with 
normal Epo bioactivity. His** — Ala and Asp^ Ala were not 
secreted from COS cells as determined by radioimmunoassay 
or Western blot analyses and therefore could not be assayed for 
bioactivity. 

(c) In predicted Helix D, the Gly*** — Ala mutein was also 
nearly devoid of specific bioactivity. as assayed in both human 
UT7 cells and murine HCD57 cells. Moreover, the Ala replace- 
ment of the neighboring residue Lys"' had decreased specific 
bioactivity in the two murine cell bioassays but normal activity 
in human UT7 cells. Of additional interest is the increased 
specific bioactivity noted in muteins in which nearby residues 
were replaced by Ala: Asn"'. Ser***, Arg***, and perhaps Lys****, 
Table I). In contrast no significant abnormalities in specific 
bioactivity was noted when Ala replaced the following residues 
in Helix D: Asp^, Arg*", and Arg***. Because the exact C- 
terminal boundiary of Helix D is uncertain, and because pre- 
liminary data from other laboratories (38, 39) suggest that the 
C-termlnal region of Epo may be functionally important, we 
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FiQ. 2, Bioasflays of mutations at Arg" (A, Arg" Ala; Ar^* Glu; and C, Arg" Leu). Epo-dependent HCD57, UT7, and primary 
murine spleen cells were incubated with increasing amounts of wild type (WT) or mutant Epos (muteins). In this, as in subsequent figures, the 
number after WT refers to a particular transfection. WT ave refers to the average of three separate COS cell transfections. The concentrations of 
wild type Epo and Epo muteins were measured by radioimmunoassay. The uptake of [*H I thymidine provides a measure of Epo-induced cellular 
proliferation. Differences in incorporation of radioacti\*ity as shown in the ordinate reflect different doubling times of the three cell types. The 
concentration of Epo in the COS cell supernatants is determined by an Epo radioimmunoassay (see •'Materials and Methods"). In the bioassay 
curves the abscissas represent increasing concentrations of wild type or mutant Epo. Each assay consisted of three replicate measurements of 
[^HHhymidine incorporation at nine concentrations of Epo. The area under the curve {i.e. the sum of these values, corrected for nonspecific 
backgrxmnd incorporation) provides the measure of specific bioactivity used in the compilation of Table I. 



tested alanine replacements at positions Lys*". Leu*", Lys***. 
Leu*", Tyr***, Thr"', Gly*", and Glu'"* which are predicted to 
be in the adjacent nonhelical segment at the C terminus. As 
shown in Table I, Lys'" — Ala had increased bioactivity, 
whereas the other replacements were indistinguishable from 
wild type. 

We examined competition between wild type and mutein 
Epos as shown in Fig. 5. A bioassay in UT7 and in HCD57 cells 
was performed with a constant amount of wild type Epo and 
increasing amounts of competitor mutein Epo. Even when pres- 
ent at 5-fold excess, the biologically inactive Arg**" Ala com- 
petitor Epo mutein failed to affect the specific biological activity 
of wild type Epo, in either cell system (Fig. 5A). In comparison, 
a 5-fold excess of the Arg" Glu mutein resulted in a slight 
enhancement of bioactivity, suggesting that, at high concentra- 
tions, it can bind to the receptor and induce proliferation. As a 
control (Fig. 6C), the Arg** Leu mutein, which has nearly 
normal specific bioactivity, when added in 5*fold excess, gave 
the expected robust increase in proliferation. 

At other sites predicted to be on Epo's surface. Ala replace- 
ments had no significant effect on biological activity. Lo^s^^ -* 
Ala in interhelical region A*B resulted in a mutein with wild 
type Epo bioactivity. Serine substitutions at Phe^, Ala", TVp", 



and Lys" also resulted in muteins with wild type Epo bioactiv- 
ity. whereas the lyr*® -*■ Ser mutein had 150% of wild type Epo 
bioactivity in both UT7 and HCD57 cells. As shown on Table 1, 
Ala replacements of conserved residues predicted to be on the 
surface of Helix B (Gln«, Glu«, TVp", Gln« Gl/*, Uu®, Ser", 
and Arg'®) had normal specific bioactivity. Ala^ Gly had 
increased bioactivity in UT7 cells. Glu" Ala was not secreted 
by COS cells as determined by radioimmunoassay and Western 
blot analysis (results not shown) and therefore could not be 
assayed for bioactivity. 

DISCUSSION 

Initial studies of Epo's structure-function relationships re- 
lied primarily on the use of antibodies (40-49). Mapping of 
specific domains has been based on polyclonal antibodies raised 
against Epo peptides and monoclonal antibodies raised against 
either peptides or the intact protein. However, it cannot be 
assumed that antibodies which neutralize Epo's biological ac* 
tivity always bind to a functionally important domain such as 
the receptor binding site. Antibodies that bind to a functionally 
irrelevant site may impair bioactivity by either steric hin- 
drance or the induction of a conformational shift. Reports that 
antibodies to peptide 111-129 (42, 48) and to the C-terminal 
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Fig. 3. Bioassays on mutations in Helix C that have abnormal function. Arg'*" 
Ala (A), Ser^" Ala (J5). and Lcu***^ Ala (C) were assayed in the cell types as described 
in the legend to Fig. 2. 
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peptide 152-166 (47) neutralize Epo function, must be viewed 
with caution in view of our demonstration that deletions at 
these sites; AllO-119. A163-166 (25) and Ala replacements at 
residues 152-159 (Table I) fail to affect biological activity. 

In contrast, several (40, 41, 45. 47), but not all (43). investi- 
gators have reported that biological activity is not impaired 
when Epo is bound to antibodies directed against N-terminal 
peptides. These results must be reconciled with our finding 
that replacement of Arg" with Ala or GIu, diminishes or abol- 
ishes Epo biological activity, respectively. Arg" is predicted to 
be the fourth residue in a-Helix A, and therefore, the epitope in 
this portion of the N terminus is likely to differ substantially 
from the unordered structure of the peptide immunogen. 
Sytkowski and Fisher (41) reported that a monoclonal antibody 
raised against the 26-residue N-terminal peptide of human Epo 
bound to some animal Epos (rat) but not others (sheep and dog). 
All of these animals share an identical N-terminal sequence 
through position 15 (24). Thus, this monoclonal antibody is 
likley to bind an epitope C-terminal to Arg*^. 

There is much less information in the literature on the prep- 
aration and testing of site-directed mutants of Epo. Initially, 
attention focused on the distilfide bonds (50) (see also Ref 25) 
and N-linked glycosylatioa sites (61). Quelle et al (38) prepared 
muteina in which 6--8-re8idue cassettes were inserted into the 
N and C termini of Epo. The former had normal specific bioac- 
tivity in agreement with our recent data (25, 52), shovdng 
nearly normal specific activity of Epo in which residues 2-5 
were deleted and of Epo containing the propeptide at the N 



terminus. The cassettes which Quelle et al. (38) inserted into 
the C terminus resulted in muteins that were biologically in- 
active. In contrast, as mentioned above, we have shown that 
muteins containing deletions of the 4 C-terminal residues 
(A 163-166) or replacement of those amino acids by a Lys-Asp- 
Glu-Lys (KDEL) or polyhistidine sequence, all had full specific 
bioactivity (25). 

Immunochemical studies led Sytkowski and his colleagues 
(42, 48) to conclude that a functional domain lay within resi- 
dues 99-129. They then prepared a series of muteins in which 
Glu-Phe was inserted in place of sequential deletions of 3 ad- 
jacent amino acids (53). Muteins in the region 99-110 were not 
secreted from C0S7 cells and, when transcribed and translated 
in vitro, were not biologically active. Recently, this group has 
reported on Ala replacements of residues 100-109. In full 
agreement with our results, they found that, with the mouse 
spleen cell assay, Arg^*' — Ala was devoid of biological activity, 
whereas Ser'*** Ala and Leu**** -* Ala had decreased bioac- 
tivity. In contrast, our earlier (25) and current results differ 
almost totally from the recent report of Bittorf al. (39) on four 
deletion muteins and four Ala replacement muteins. They ob- 
tained the following specific bioactivities: Arg*** Ala. 47%; 
Thr**» - Ala, 41%; - Ala, 9%; and Glu"» - Ala. 26%. 

whereas we obtained values of 6, 114, 160, and 86%. respec- 
tively, for these muteins. EquaUy surprising is their finding 
that A13-17, predicted to be in Helix A, was secreted and had 
normal biological activity, whereas we found that A12-16 was 
not secreted and devoid of bioactivity (25) and that replace- 
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ments of Arg** markedly impair bioactivity. These investiga- 
tors (39) employed only the human Epo-sensitive cell line 
TF-1. which in our hands is unstable and an unreliable assay 
system for muteins. 

Our strategy for designing replacement muteins for studying 
Epo*8 structure-function relationships is based a three-dimen- 



sional model, reinforced by scanning deletion mutein experi- 
ments (25) and mammalian primary sequences (24). As sum- 
marized in Table I, we have tested mutations at 54 residues 
selected by the exclusion criteria described above. The repro- 
ducibility, validity, and interpretability of these experiments 
has been considerably strengthened by the use of three bioas- 
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Fi.;. tS. Model of predicted three-dimonsional structure of Epo 
showinjj a proposed functionally important domain that encom- 
passes Arg'^ on Helix A, Arg'*^ and Ser'"^ on Helix C, and another 
that encompasses GIv"', Lys'*^, and perhaps Asn'^'. Scr'"^. Arg'^\ 
and Lys"". 

.say systems, a human cell line, a mouse cell line, and mouse 
spleen cells. 

A number of the muteins that we have tested (R14A, K20A, 
A73C.. S104A, L108A. K140A, R143A, S146A. N147A, K152A.i 
.siiowcd higher specific bioactivity with the human cells than 
with \\\Q mou.sc cells. It in likely that subtle differences in the 
sit uctures (if the human and mou.se receptors cooperate with 
small di (Terences in binding affinity to reveal a functionally 
abnormal mutein. The competition experiments shown in Fig. 
5 suggest that the two muteins with the lowest bioactivity 
(Arg** ' Olu and Arg'"' • Ala) fail to prevent binding and 
activity of wild type Epo. This experiment also rules out any 
nonspecific suppression by these muteins or the COS cell su- 
pernatants of the growth of the Epo-dependent assay cells. 

Taken together our results show that a restricted number of 
re.sidues on Helices A, C, and D are involved in Epos biological 
activity. Wo propose the existence of one functional domain 
' depicted in Fig. 5) encompassing Arg'* of Helix A, Arg'"'^ and 
Ser'"^ of Helix C, and perhaps Leu'*^ in the predicted C-D loop. 
In the three-dimensional model (25) (Fig. 6), Arg*^ is predicted 
to be within close proximity of Arg"^ and Ser'*", Moreover, 
mutagenesis and structural studies on GH, GM-CSF, and IL-4 
have demonstrated a functionally impoi-tant patch, presumably 
receptor binding, that encompasses portions of Helix A and 
Helix C «23). We propose a second functional domain (also de- 
picted in Fig. 5) in Helix D at Gly"" and perhaps Lys'"", where 
alanine replacements show decreased bioactivity.' Our finding 
of enhanced bioactivities of Ala replacements at nearby resi- 
dues ( Arg'^\ Ser**^ and Asn"') is more difficult to interpret but 
may indicate an extension of this functional domain. Additional 

^ It is possible that this functional patch also includes a portion of 
Helix A. analogous to one of the receptor binding domains in GH (13). 
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amino acid ri? placemen is in the vicirv.ty (»t' llu'.se re.>idLa's in 
Hi;lic(.'ri A. (.'. and D shduld provide fimht'i- i-esol nt.ion of our 
pnjposod Cunclional doniain.s. We are parlieulai ly inleru.^ted in 
producing mlltein^; in which a comhir.r-.-.ion of .selected rephue- 
ments result in a maximal enhancemeni of bioactivity, perhaps 
enabling the development of an imprrAed therapeutic agent. 
Further experiments are also in progres.s to determine if the 
ohser\'ed alterations in biological activity of selected muteins 
can, like GH l54). be explained by parallel changes in the bind- 
ing afimity to the Epo receptor. 

Our structure- function studies suggest that Epo binds to its 
receptor at the two distinct domains discussed above. Our as- 
signments are consistent with studies on site-directed muteins 
of other cytokines, including GH, IL-2. IL-3; IL-4. lL-5. and 
GM-CSF f23 1, which show that residue? on Helix \ and Helix D 
play a c)-itical role in biolog-ical activi:y. X-ray difTractinn anal- 
ysis ijf tile GH CIl receptor complex l :.' shuu.s tlial dislinrl 
domains on Helix A and Helix D of GH each bind to a separate 
receptor molecule, thereby facilitating dinierization of the re- 
ceptor, perhaps triggering signal transduction. 
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Mappiig of the Active Site of Recombinant Human Erytfropoictin 

By Steve Elliott, Tony Lorenzini, David Chang, Jack Barzilay, and Evelyne Delorme 



Recombinant human erythropoietin (rHuEPO) variants have 
been constructed to identify amino acid residues important 
for biological activity. Immunoassays were used to deter- 
mine the effect of each mutation on rHuEPO folding. With 
this strategy, we could distinguish between mutations that 
affected bioactivity directly and those that affected bioactiv- 
ity because the mutation altered rHuEPO conformation. Four 
regions were found to be important for bioactivity: amino 
acids 11 to 15, 44 to 51, 100 to 108, and 147 to 151, EPO 

ERYTHROPOIETIN (EPO) is a hormone produced pri- 
marily by the kidney. It is involved in the growth and 
mjitviration of er^'throid cells from precursors. Decreased 
production of EPO due to kidney failure results in anemia. 
The human EPO gene has been cloned,''^ and the hormone 
has been produced by recombinant DNA techniques (recom- 
binant human EPO [rHuEPO]). The human gene encodes a 
1 65 -amino acid mature protein. The secreted protein con- 
tains approximately 40% carbohydrate and has an approxi- 
mate molecular weight of 30 kD.-^ The carbohydrate has been 
shown to have an effect on protein stability and solubility, 
but it is not required for in vitro bioactivity.'*'^ This indicates 
that the active site is retained in the protein portion of the 
molecule. 

The mechanism by which EPO stimulates erythropoiesis 
is partly understood. EPO binds to specific cell-surface EPO 
receptors. Activation of the EPO receptor triggers intracellu- 
lar signaling events including phosphorylation of the recep- 
tor, followed by activation of the JAK-STAT, RAS, and PI3 
kinase pathways.*"*' These signaling pathways trigger cells 
to undergo proliferation and differentiation and to block 
apoptosis. Cloning of the murine and human EPO recep- 
(Qj.gi2.i3 ^j^jj igjgj. identification of a constitutively active re- 
ceptor mutant with an Argl29 to Cys mutation'^ '^ led to the 
proposal that dimerization of the receptor is responsible for 
its activation. This proposal was supported by the observa- 
tion that bivalent monoclonal antibodies but not monovalent 
Fab fragments could activate the EPO receptor in the absence 
of EPO.'*- 

If activation of the EPO receptor is indeed due to homodi- 
merization, then EPO must be capable of simultaneously 
binding two receptors. The fact that rHuEPO exists in solu- 
tion as a monomer suggests that it must have two receptor 
binding sites much like that of growth hormone. !n this case, 
a growth hormone monomer activates its receptor by cross- 
linking two growth hormone receptors." '^ However, for 
EPO, the location of the two binding sites has not been 
definitively identified. 

Several strategies have been used to try to identify the 
active site of EPO. These studies included mapping of neu- 
tralizing antibody epitopes^" and analysis of the effect of 
deletions"'" or point mutations^'-"'" on bioactivity. These 
studies implicate several regions that are important for bioac- 
tivity in EPO, Arg 14. residues between 100 and 110, and 
some residues near the C-iemiinus. Interpretation of the re- 
sults of these experiments is difficult, because the crystal or 
nuclear magnetic resonance structures of EPO have not been 
reported. In addition, interpreting mutagenesis results is 



variants could be divided into two groups according to the 
differential effects on EPO receptor binding activity and in 
vitro biologic activity. This suggests that rHuEPO has two 
separate receptor binding sites. Mutations in basic residues 
reduced the biologic activity, whereas mutations in acidic 
residues did not. This suggests that electrostatic interac- 
tions between rHuEPO and the human EPO receptor may 
involve positive charges on rHuEPO. 
© 7357 by The American Society of Hematology. 

complicated by the difficulty of distinguishing between 
changes in bioactivity caused by changes in the active site 
and changes in protein folding that indirectly affect bioactiv- 
ity. The effect of mutations on folding of EPO variants was. 
not established in previous studies, except for an Arg 103 to 
Ala mutation.^' 

We wished to accurately map the active site of rHuEPO 
and determine whether one or two binding sites could be 
identified. We constructed 141 different rHuEPO variants 
and tested them for biologic activity. We have previously 
reported the development of an immunologic approach that 
allowed testing of the effect of mutations on rHuEPO confor- 
mation.^"'^' Analysis of the effect of mutations in rHuEPO 
on its immunoreactivity with this method has been used 
previously to identify buried residues in rHuEPO and those 
important for the rHuEPO structure. With this strategy, we 
have identified mutations that are either predicted to be on 
the exposed surface of rHuEPO or only modestly affect the 
rHuEPO structure. Thus, we have identified residues that 
appear to be part of the rHuEPO active site. According to 
models that suggest that rHuEPO is a four-helical bun- 
2U.27J3.34 jj^g residues important for bioactivity can be lo- 
calized to two different regions. This suggests that rHuEPO 
has two different binding sites. 

MATERIALS AND METHODS 

Constmction and expression of rHuEPO variants. The rHuEPO 
cDNA was obtained from COS-1 cells transformed with a human 
rHuEPO genomic clone.-*" The coding region was recovered as an 
810-nucleotide BsrEW-Bgl II DNA fragment and was cloned into an 
SV40 vector.** The vector contained a dihydrofolate reductase 
(DHFR) gene for selection in Chinese hamster ovary (CHO) DHFR- 
cells. All the variants were expressed in COS-1 cells. Transfection 
was by cither the calcium phosphate method^ or electroporation.^° 
Conditioned medium was collected after 3 to S days. Aliquots were 
made and stored at -10°C. In cases for which larger amounts of 
material were needed, stable CHO cell transformants were made. 
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CHO ceMs were transfcctcd by the calcium phosphate method, and 
transfoniiants were selected for DHFR expression. 

To construct rHuEPO variants, in vitro oligo-dirccted mutagenesis 
was pcrfonncd.^" In brief, a DNA segment containing human EPO 
coding sequences was transferred to M 13mpI8, and single-stranded 
DNA was produced in a tint ung Escherichia coli strain (RZ1032). 
The single-stranded DNA was annealed with synthetic oligonucleo- 
tides containing the desired mutations, and was then extended with 
Klenow polymerase in the presence of T4 DNA ligase. The DNA 
was then transfcctcd into E. coli strain JMI09. Variants with the 
desired mutations were identified by differential hybridization using 
the mutant oligonucleotide. The presence of the mutations was con- 
firmed by DNA sequencing, and then the DNA segment was trans- 
ferred to the expression vector. 

.'ina lysis of rHuEPO expression. The rHuEPO concentration 
( units per milliliter) in conditioned inedium for each rHuEPO variant 
was detennined by four to six different nnmunoassays. 1 he EPO 
standard used in the assays was rHuEPO produced in CHO cells. 
'"I-EPO was obtained from Amersham (Arlington Heights, IL). The 
isolation and characterization of antibodies and immunoassays has 
been described elsewhere.^"*^' These immunoassays included a radio- 
immunoassay (RIA) using an antibody raised to purified rHuEPO 
(RIA-P); a RIA using an antibody raised to a synthetic peptide 
corresponding to the first 20 amino acids of rHuEPO (RIA-N); and 
enzyme-linked immunosorbent assays (ELlSAs) using two different 
monoclonal antibodies that recognize different rHuEPO epitopes 
(EIA-DIl and EIA-FI2). The ELISA format involved capture of 
rHuEPO by the monoclonal antibody, and then an incubation with 
one of two different signal polyclonal antibodies raised to rHuEPO. 
Two different polyclonal signal antibodies raised in goats or rabbits 
were available. The RIA-N immunoassay is largely unaffected by 
rHuEPO conformation. In addition, the epitopes recognized by these 
immunoassays have been detcmiined.^° Therefore, it was possible 
to determine the rHuEPO concentration in samples by selecting 
assays that were unaffected by the introduced mutation. Each 
rHuEPO variant was also assayed with the monoclonal antibody 
9G8A (RI A-9G8A). The specific 9G8A immunoreactivity was deter- 
mined as units of 9G8A immunoreactive EPO divided by total units 
of immunoreactive EPO as determined earlier 

Increased immunoreactivity measured by RIA-9G8A is indicative 
of altered folding. Completely denatured rHuEPO yields an immuno- 
reactivity ineasured by this antibody that was 20- to 40-fold higher 
than expected.^' In addition, immunoreactivity appears to increase 
in proportion to the degree of unfolding," Immunoreactivity mea- 
sured by this assay that was 2.5 times higher than expected was 
considered indicative of altered folding. 

Purification of rHuEPO variants. Conditioned medium from 
CHO cells expressing the various rHuEPO variants was purified as 
previously described. 

In vitro bioassays. In vitro bioactivity was 'determined by mea- 
suring thymidine uptake in a factor-dependent cell line. 32D,^*-" that 
had been made dependent on EPO for growth by introduction of a 
murine EPO receptor gene (32D + EPOR*°). Assays were performed 
as previously described*" with the following modifications. 320 cells 
were factor-starved for 3 to 5 hours in assay medium (RPMI 1640 
medium; GIBCO BRL. Grand Island, NY) containing 10% fetal 
bovine serum and 1% Glutamine Pen-Strep (Irvine Scientific, Santa 
Ana, CA) lacking EPO. Test samples or EPO standard (rHuEPO) 
were added to wells in a 96-wcU microtitcr plate; 50 fiL starved cells 
were then added ( 1 5,000 ccIIs/wcll). and the plates were incubated in 
a humidified incubator at 3TC and 6% CO2. After 18 to 24 hours, 
50 tiL methyI-*H-thymidinc (I mCi/mU 20 Ci/mmol) diluted 1:100 
in assay medium was added. Plates were incubated for an additional 
4 hours at 3TC and 6% COi- Labeled cells were harvested onto 
glass fiber filtennats using a PHD cell harvester (Cambridge Tech- 



nology Inc, Watcrtown. MA) and dcionizcd water as a washing 
solution. Fillers were rinsed a final time with 2-propanol, and then 
were dried and counted in a Beckman (rullerton, CA) model 
LS6000IC scintillation counter. Biologic activity was expressed in 
units using rHuEPO as the standard. The specific bioactivity was 
units of bioactive EPO divided by units of immunoreactive EPO. 

Receptor binding. Receptor binding activity was determined by 
cold-displacement assays with high-specific activity '"l-EPO (Am- 
ersham) on a human cell line (OCIMT'). Cells were grown to 2 to 
5 X lOVmL. collected by centrifugation. washed two times in bind- 
ing buffer (RPMI 1640 mcdium/lVo bovine scrum albumin/25 mmoV 
L HEPES, pH 7.3), and then resuspended at 1 to 2 X lOVmL in 
binding buffer containing 0.1% azide and 10 /ig/mL cytochalisin B. 
Cells were incubated with varying amounts of sample and '^'1-EPO 
at 37**C. After 3 hours, cells were centrifuged through phthalate oil 
(60;40 vol/vol dibutyl.dinonyl phthalate 1. The tubes were then quick- 
Irozen in diy ice ethanoi, the pellets were clipped, and liie amount 
of bound '"1-EPO was detennined by counting in a gamma counter. 
Receptor binding activity was determined by comparison of the 
displacement by the rHuEPO variants to a cold-displacement stan- 
dard curve generated with a rHuEPO standard. Values were thus 
relative to the standard and are reported in units per milliliter. Spe- 
cific receptor binding activity was determined by dividing by the 
amount of immunoreactive EPO detennined by immunoassay. 

RESULTS 

Construction and analysis of rHuEPO variants. We 
have previously constructed over 200 difTerent rHuEPO mu- 
tations. These have been analyzed to determine which ones 
affect rHuEPO structure.-" We have also used immunologic 
data to predict residues thought to be on the solvent-exposed 
surface of rHuEPO. Thus, we selected from this group 141 
individual mutations with substitutions at 1 18 of 165 amino 
acid positions in rHuEPO and tested them for in vitro biolog- 
ical activity. At several positions, multiple substitutions were 
made and the rHuEPO variants were assayed. Thus, the addi- 
tional mutations would confirm the importance of the posi- 
tion for bioactivity and provide information on the amino 
acid requirements at thai site. 

Each variant was transiently expressed in COS-1 cells, 
and conditioned medium -was collected. In several cases, 
stable expression was obtained in CHO cells to produce 
larger quantities of rHuEPO variants. To perform the analy- 
sis on a large scale, most of the variants were not purified. 
Instead, each supernatant was assayed by up to six difTerent 
immunoassays. Therefore, it was possible to determine the 
concentration of variants in the cell-conditioned medium us- 
ing the assays that were unaffected by the introduced muta- 
tion. The effect of each mutation on rHuEPO conformation 
was also estimated with RIA-9G8A. The immunoreactivity 
measured by this assay increases when rHuEPO is dena- 
tured.' ' 

Figure 1 shows results of the in vitro bioassays that mea- 
sured thymidine uptake in the EPO-dependent cell line 32D 
-♦- EPOR. Figure 1 also shows the effect of each mutation 
on folding according to the results of RIA-9G8A immunoas- 
says. Several amino acid segments in rHuEPO could be mu- 
tated without affecting bioactivity. These included residues 
21 to 44, 52 to 95, and 109 to 140. Thus, these regions do 
not appear to play a major role in bioactivity. In addition, a 
termination codon introduced at Thrl63 also had no apparent 
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IN VITRO ACTIVITY (THYMIDINE UPTAKE) 



Fig 1 . Mutagenesis of rHuEPO. 
Amino acid residues in the 
rHuEPO sequence (black drdes 
with white letters) and rHuEPO 
variants (acQacent to rHuEPO se- 
quences with shaded symbols) are 
indicated. 0% substitution of a ter- 
mination codon, TGA. The shape 
of the symbols indicates the effect 
of the mutation on MA-SGSA in> 
munoreactivity. The shacfing code 
represents the effect on in vitro 
bioactivity. 



O >70% Active 

O 20-70% Active 

O <20% Active 

• <2% Active 



O 9G8A<2.5X 
n 9G8A 2.5-10 X 
O 9G8A>10X 




effect on bioactivity. which suggests that the C-terminal resi- 
dues are not important for bioactivity. Introduction of a pro- 
line at -2 relative to the amino-terminal Ala affects cleavage 
of the signal peptide, resulting in a three-amino acid exten- 
sion (Val Pro Gly^"*^*^). In vitro activity and folding mea- 
sured by RIA-9G8A with a Pro-2 variant was normal (data 
not shown), which suggests that the amino terminus is not 
important for bioactivity. This variant has an altered amino 



terminus as evidenced by a lack of immunoreactivily with 
antibodies that bind to the amino terminus.^" Forty-three 
mutations at 36 positions did have an affect on bioactivity. 
These mutations were concentrated in four regions: residues 
10 to 20, 44 to 51, 96 to 108, and 142 to 156. Mutations 
that reduced the bioactivity by over 50- fold were found in 
all four regions. Mutations at several other scattered posi- 
tions also reduced bioactivity. These included the mutations 
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Gln59, Glu62, Lcu67, and Leu70 and disulfide bonds. These 
latter mutations also affect folding and/or are predicted to 
be in buried positions.'" Thus, the mutations in these residues 
may affect bioactivily indirectly. 

All rHuEPO variants in Fig I were also tested for the 
ability to stimulate fonnation of erythroid bursts using ery- 
throid precursors from human bone marrow. The results were 
qualitatively similar to those in Fig 1 (data not shown). 
Mutations in all the residues that reduced thymidine uptake 
in 32D + EPOR also reduced erythroid burst formation. In 
addition, variants identified as having the greatest reduction 
in thymidine uptake activity also had the greatest reduction 
in erythroid burst formation. The fact that human cells were 
used in this assay suggests that the residues identified as 
being important for activation of 32D - EPOR (murine cells) 
are also important for activation of human cells. Thus, the 
mutations in the putative active site affect both proliferation 
and differentiation and are important in both species. 

Role of Tyr and acidic and basic residues in bioactivity. 
It has been reported that chemical modification of tyrosines 
can inactivate rHuEPO/'*-^* To see if we could identify which 
tyrosines were responsible for this effect, each of the four 
Tyr residues were individually mutated. Mutations at Tyr49 
or Tyrl45 had no effect on bioactivity (Fig I). The Tyrl5 
and Tyr 1 56 mutations included Phe and lie subslitutions. 
The Phe 15 and Phe 1 56 mutations had no effect on either 
bioactivity or rHuEPO folding, as evidenced by a normal 
immunoreactivity measured by RIA-9G8A. However, an 
He 156 substitution affected both RIA-9G8A immunoreactiv- 
ity (20-fold increase) and bioactivity (1 1 -fold decrease), and 
an He 15 substitution reduced bioactivity over 200-fold while 
at the same time having no apparent effect on RIA-9G8A. 



These results suggest that an aromatic amino acid is required 
at position 15 for bioactivity, and that an aromatic amino 
acid at position 1 56 may be required for proper folding. 

Chemical modification of amino groups (eg, Lys) has also 
been reported to affect bioactivity."^* In addition, electro- 
static interactions between charged residues in four-helix- 
bundle proteins and their receptors have been reported to 
play a role in hormone receptor recognition."'* We therefore 
examined the effect of substitutions in the charged residues 
in rHuEPO. rHuEPO has six Asp, 12 GIu, eight Lys, 13 
Arg, and two His residues.''^ All the charged residues except 
ArgI62 were individually mutated to uncharged and/or op- 
positely charged residues. The Asp at 165 was removed by 
substituting a temiination codon at Thrl63 (Fig 1). Some 
substitutions of the acidic residue? ?.:Toctcd folding. How- 
ever, only a Glu62 to Thr mutation resulted in a decrease in 
bioactivity twofold. This variant also had a 7.5-foId increase 
in RIA-9G8A immunoreactivity, and an Ala62 mutation had 
no effect on bioactivity. This suggests that Glu62 may be 
important for structure but not for bioactivity, and that acidic 
residues as a group do not have a major role to play in 
bioactivity. The two His residues at 32 and 94 were also 
mutated. Mutations in these residues also had no effect on 
bioactivity. The results were different when the lysine and 
arginine residues were mutated. Mutations at four of eight 
lysine residues (positions 20, 45, 97, and 152) and five of 
13 arginine residues (positions 10. 14, 103, 143, and 150) 
affected bioactivity. Some of the mutations that affected bio- 
activity also affected folding (eg, at Lys97, Lys 152, Arg 10, 
and Arg 143). Therefore, we cannot eliminate the possibility 
that for these latter four residues the reduced bioactivity was 
due to indirect effects. Mutations at Arg 14, Arg 103, Arg 150, 
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Rg 2. Effect of mutattons on 
structure and function for resi* 
dues 5 to 23. A helical wheel pre- 
sentation shows amino acids 
found in rHuEPO (closest to the 
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tions (in boxes). R1A-9G8A im- 
munoreactivity for each variant 
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cent to the mutation. Shading in- 
dicates the effect of the muta- 
tion on in vitro bioactivity. 
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and Lys45 had dramatic effects on bioactivity with no appar- 
ent effect on folding. This suggests that these are part of 
the active site. Substitutions at Lys20 revealed additional 
information about charge requirements. An isoleucinc substi- 
tution at Lys20 had no effect on bioactivity; however, a 
glutamic acid substitution reduced bioactivity approximately 
1 1 -fold. Receptor binding activity was also reduced eightfold 
for the Glu20 mutation (data not shown). None of these 
mutations significantly affected RIA-9G8A immunoreactiv- 
ity. This suggests that there is not a requirement for a posi- 
tively charged residue at this position, but that a negatively 
charged residue disrupts receptor binding and thus bioactiv- 
ity. This observation in combination with the fact that only 
mutations in the positively charged lysine and arginine resi- 
dues effected bionctivity suggests that the electrostatic inter- 
actions between rHuEPO and its receptor may be modulated 
primarily by positive charges on rHuEPO and presumably 
by negative charges on the EPO receptor. 

Identification of active site mutants. With an analysis of 
this type, one would expect many of the mutations to result 
in a reduction in biologic activity because the rHuEPO struc- 
ture was affected. For example, disruption of disulfide bonds 
has been shown to alter rHuEPO structure and activity, and 
thus they presumably are not part of the active site.*' "-^- 
Some of these latter types of mutations were identified ac- 
cording to high immunoreactivity as measured by RIA- 
9G8A. For example, at position 126, five different substitu- 
tions were evaluated. Only an Ala 126 mutation affected 
RIA-9G8A immunoreactivity, and this was the only muta- 
tion that affected biologic activity even though the other 
substitutions were nonconservative. Thus, il is unlikely that 
SerI26 is part of the active site. 



Some of the regions in which mutations affect biologic 
activity are in potential or-helices.-" Mutations that disrupt 
protein structure as evidenced by increased RIA-9G8A im- 
munoreactivity are concentrated on the buried sides of the 
predicted helices. The predicted helices have been desig- 
nated helix A (Fig 2), helix B (Fig 3), helix C (Fig 4), and 
helix D (Fig 5). Thus, it is possible to discriminate between 
direct and indirect effects of the mutations on activity by 
examining the locations of the affected residues on helical 
wheels. 

Mutations in three positions in helix A (Fig 2) had large 
effects on bioactivity and were concentrated on the same 
side of the helix: Vall 1, Argl4, and Tyrl5. Gin 1 4 and He 15 
mutations did not affect immunoreactivity measured by RIA- 
')GSA but reduced the bioncti\'ity over 50- fold, which sug- 
gests that they may be folded nonnally. Therefore, these 
residues are probably part of the active site. The Serl 1 muta- 
tion also affected bioactivity, and it is on the same face of 
the predicted helix as Argl4 and Tyrl5- However, the muta- 
tion had a fourfold increased immunoreactivity measured by 
RIA-9G8A, suggesting an effect on folding. Mutations in 
residues Arg6, ArglO, and Leu 12 had modest effects on 
bioactivity. Mutations in these residues had an effect on 
rHuEPO folding as evidenced by their increased R1A-9G8A 
immunoreactivity, and they are on the same face of the helix 
as the bulk of the hydrophobic amino acids (eg. Leu 12, 
Leu 16, and Leu 17). Therefore, these mutations may be in 
residues that are buried. The Cys7 to Ser mutation had an 
effect on biologic activity, but it also had a dramatic effect 
on RIA-9G8A immunoreactivity, presumably because it is 
involved in formation of a Cys7-Cysl61 disulfide bond that 
is important for rHuEPO structure. ^ 



Fig 3. Effect of mutations on 
structure and function for resi- 
dues 62 to 79. A helical wheel 
presentation shows amino acids 
found in rHuEPO (closest to the 
circle). Arrows point to muta- 
tions (in boxes). RIA-9G8A Im- 
munoreactivity for each variant 
is shown in parentheses adja- 
cent to the mutation. Shading in- 
dicates the effect of the muta- 
tion on in vitro bioactivfty. 
Predicted buried and exposed 
surfaces are Indicated. 
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Fig 4. Effect of mutations on 
structure and function for resi- 
dues 94 to 111. A helical wheel 
presentation shows amino acids 
found in rHuEPO (closest to the 
circle). Arrows point to muta- 
tions (in boxes). RtA-9G8A im- 
munoreactivfty for each variant 
IS shown in parentheses adja- 
cent to the mutation. Shading in- 
dicates the effect of the muta- 
tion on in vitro bioactivity. 
Predicted buried and exposed 
surfaces are indicated. 



Mutations at three positions in helix B affected bioactivity: 
G!u62, Leu67, and Leu70. However, mutations in all three 
positions affected RIA-9G8A immunoreactivity and are pre- 
dicted to be in buried positions. This suggests that mutations 
in helix B that affect bioactivity do so indirectly (Fig 3). 

SerlOO, Argl03, Serl04, and LeulOS (helix C, Fig 4) are 
predicted to be on the protein surface, and mutations in 
them appear to be folded normally but dramatically affect 
bioactivity. Four substitutions were examined at SerlOO, in- 
cluding Glu, Thr, Arg, and Ala. Only the Ala 100 mutation 
had normal bioactivity. Mutations in residues on the opposite 
face of the predicted helix, at Leu 102 and Leu 1 05. severely 
affect folding as evidenced by high RIA-9G8A. Thus, it is 
unlikely that Leu 102 or LeulOS are part of the active site. On 
the predicted exposed surface of helix D (Fig 5), mutations 
at AsnI47, Arg 150, and Glyl51 had the greatest effect on 
bioactivity. An Arg 143 to Ala mutation had a modest effect 
on bioactivity, but this mutation also affected folding. A 
Leu 1 55 to Ala mutation also affected bioactivity. This resi- 
due may be partially buried or may be important for struc- 
ture, since mutations in it had an effect on folding. An 
Alal51 substitution resulted in greater than a 150-fold de- 
crease in bioactivity. However, the AlalSl mutation affected 
folding significantly. Several mutations in predicted buried 
positions affected RIA-9G8A immunoreactivity and bioac- 
tivity: Phel42, Leul49. Lysl52, Leul53. and Tyri56. 

Mutations between residues 42 and 5 1 also affected bioac- 
tivity. This region is thought to be in a connecting loop 
between helices A and B. An Asp45 mutation resulted in 
reduced bioactivity but appeared to be folded normally, sug- 



gesting that it may play a role in bioactivity (Table 1). The 
substitutions Asn42, lle44, Ala46, Asn5I, and Phe51 af- 
fected bioactivity also. However, the mutations also affected 
folding, so we cannot definitively conclude which ones are 
part of the active site. 

Receptor binding activity and biologic activity of rHuEPO 
variants reveal two classes. The residues thought to play 
a role in in vitro bioactivity were examined to see what 
effect they had on binding to the EPO receptor. Several 
rHuEPO variants were expressed in both COS-1 and CHO 
cells (Table 1). All the mutants that affected bioactivity in 
the A/B connecting loop are shown, even though we do not 
know which of them may be directly involved in biologic 
activity. The helix A, C. and D variants shown include only 
those with mutations in residues thought to be on the solvent- 
exposed surface. We purified three different rHuEPO vari- 
ants and natural-sequence rHuEPO to confirm the methods 
used to calculate activities. This was necessary because we 
determined the rHuEPO concentration in conditioned me- 
dium by immunoassay. The results are shown in Table 2. In 
all cases, results on purified material compared favorably 
with results obtained with unpurified material. This was true 
for effects on both biologic activity and receptor binding 
activity. 

Some of the mutations had modest effects on receptor 
binding activity but no detectable biologic activity (Tables 
! and 2). Similar results were observed for both COS and 
CHO cell-expressed rHuEPO variants. This result suggests 
that rHuEPO binding by itself is not sufficient for activation 
of the EPO receptor. The four regions important for bioactiv- 
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HELIX D 



Fig 5. Effect of mutations on 
structure and function for resi- 
dues 140 to 157. A helical wheel 
presentation shows amino acids 
found in rHuEPO (closest to the 
circle). Arrows point to muta- 
tions (in boxes). RIA-9G8A im- 
mu no reactivity for each variant 
is shown in parentheses adja- 
cent to the mutation. Shading in- 
dicates the effect of the muta- 
tion on in vitro bioactivity. 
Predicted buried and exposed 
surfaces are indicated. 




( ) 9G8A Immunoreactlvity % 
ncXXl In vitro activity >70% 
In vitro activity 20-70% 
In vitro activity <20% 
In vitro activity <2% 



ity could be divided into two groups according to the differ- 
ential effects on biologic activity and receptor binding activ- 
ity. The variants with changes in helices A and C retained 
substantial receptor binding activity (s=5% of normal) even 
though bioactivity in many cases was low or not detectable 
(Argl4/Gln, Tyrl5/Ile, SerlOO/Glu, SerlOOAFhr, Argl03/ 
Ala, Serl04/Ile. and Lysl08/Ala; Table 1). Thus, a relatively 
small change in receptor binding activity due to mutations 
in these regions resulted in much greater effects on bioactiv- 
ity. In contrast, substitutions in residues between 42 and 5 1 
and in residues in helix D had receptor binding activity that 
more closely correlated with biologic activity {Val46/Ala, 
Trp5I/Phe, Lysl47/Ala. and Argl50/Ala), Mutations in 
these two regions also differed from mutations in helices A 
and C in that some mutations resulted in dramatic decreases 
in receptor binding activity (a: 100-fold decrease; Lys45/ 
Asp, Asnl47/Lys, and Glyl51/Ala). One explanation for 
these observations is that the two groups may represent resi- 
dues in different receptor binding sites, and each receptor 
binding site has a different role to play in activity. 

DISCUSSION 

We have devised a strategy to evaluate the effect on fold- 
ing and bioactivity of 141 different rHuEPO mutations at 
1 1 8 positions, and found that four regions were important 
for bioactivity. These have been localized to helix A (Vail 1, 
Argl4, and TyrlS). helix C (SerlOO, Argl03, Serl04. and 
LeulOS), helix D (Asnl47, ArglSO. Gly!5l, and Leul55), 
and the A/B connecting loop (residues 42 to 5 1 ). 



Previous studies implicated three of four regions we iden- 
tified as having an important role to play in biological activ- 
ity. These regions included residues between 10 and 15, 100 
and 108. and 150 and 159.^**'" In particular, alanine-scanning 
mutagenesis data suggested that amino acids 14, 103, 104, 
105, 108, and 151 were important for bioactivity and that 
residues 100, 101, 102, 106, 107, and 109 were not.'' " 
We have also introduced alanines in all these positions and 
obtained qualitatively similar results. At three of these posi- 
tions, other substitutions were made. These substitutions, 
GlulOO/Ser, Serl04/Ile, and Leul08/Lys mutations, reduced 
bioactivity over 1,000-, 500-, and 5,000-fold, respectively, 
whereas Ala substitutions resulted in modest or no effects. 
Thus, at these positions Ala can substitute for the existing 
amino acid, resulting in effective activation of the EPO re- 
ceptor, but other amino acids cannot. Our results and those 
of Wen et al" and Grodberg^' are in disagreement with 
another study that reported that an Ala 103 mutation resulted 
in a modest effect on bioactivity.-*" An Ala 154 mutation was 
reported to reduce bioactivity lO-fold.'" We examined 
Argl54 and SerI54 substitutions and found no effect on 
bioactivity. In addition. Wen et al" tested an Ala 1 54 muta- 
tion and reported no effect on bioactivity. The same groups** 
reported that a Glul59 to Ala mutation reduced bioactivity 
fourfold. We tested a Serl59 mutation and found no effect 
on bioactivity. The reasons for the discrepancies are not 
known, but may be due to the different amino acid substitu- 
tions, the different methods of determining EPO concentra- 
tion, or the methods of expression and/or storage. 
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A recent analysis of hematopoietic cytokines and their 
receptors suggests that electrostatic interactions are im- 
portant for receptor binding. Complementar>' electrostatic 
interactions between the binding surfaces of growth hormone 
and lL-4 and their corresponding receptors have been sug- 
gested.^'' In addition, mutations in both positively charged 
and negatively charged residues in growth hormone affect 
activity."* We examined the effect of mutations in acidic and 
basic residues, and found that only mutations in the basic 
residues affected bioactivity of rHuEPO. This differs from 
what is suggested for growth hormone, which uses both 
positive and negative charges for interaction with the growth 
hormone receptor. However, the difference in the importance 
of negatively charged residues for rHuEPO and human 
grovvih hormone is consistent with the difference in the 
charge of growth hormone and rHuEPO; growth hormone 





Table 1. 


Activity of rHuEPO Variants 










Receptor 


In Vitro 








Expression* 


Bindingt 


Activitv* 


RIA-9G8A5 


Location 


Mutation 


(U/mL» 


(% of total) 


(% of total) 


(% of total) 




Human EPO 


67 


130 


187 


100 


Helix A 


Valll/Ser 


45 


25 


6.3 


380 




ArgU/Ala 


44 


40 


11 


590 




ArgU/Gln 


21 


28 


1.7 


82 




TyrlS/lle 




28 


<0.43 


150 


A/B Loop 


Pro42/Asn 


0.8 


<1 


<0.01 


1.600 




Thr44/Ile 


7 


52 


8.7 


640 




Lys45/Asp 


53 


0.2 


<0.03 


180 




VaU6/Ala 


9 


20 


7.8 


774 




Typ51/Phe 


97 


40 


34 


340 


Helix C 


SerlOO/Glu 


79 


8 


<0.01 


150 




SerlOO/Thr 


14 


11 


<0.2 


58 




Arg103/Ala 


7.7 


20 


<0.12 


86 




Sen 04/11 e 


30 


7.3 


<0.2 


100-240 




Ser104/Ala 


12 


36 


12 


150 




Leu108/Lvs 


35 


10 


<0.02 


330 


Helix D 


Asn147/Lys 


5 


1.2 


0.3 


260 




Arg150/Ala 


47 


19 


8 


97 




Gly151/Ala 


29 


<0.7 


<0.03 


760 




Leu155/Ala 


42 


48 


16 


439 



* The rHuEPO constructs were transfected into Cos cells. After 3 to 
5 days, the concentration (U/mL) of rHuEPO was immunoassay (see 
text). 

t Receptor binding assays were performed using OCIM-1 celts (hu- 
man erythroleukemia cells) and cell conditioned media from COS cells 
transfected with the wild-type or mutant EPO constructs. The assay 
measures displacement of rHuEPO. Specific activity was calcu- 
lated by dividing by the concentration determined by immunoassay. 
Results are expressed as a percentage of total. 

t In vitro activity was the ability of cell conditioned media con- 
taining wild-type or mutant EPO to stimulate 'H thymidine uptake in 
32D+EPOR (320 cells transfected with a murine EPO receptor). Spe- 
cific activity was calculated by dividing by the concentration deter- 
mined by immunoassay. Results are expressed as a percentage of 
total. 

5 9G8A assays were radioimmunoassays using the anti-EPO mono- 
clonal antibody 9G8A. Specific activity was calculated by dividing by 
corKentration determined by immunoassay. Results are expressed 
as a percentage of total. Increased immunoreactivity is indicative of 
altered folding. 



Table 2. Activity of Purified rHuEPO Variants 

Receptor Binding in Vitro Activity 
RIA 9G8A 



Mutation 


U/mg 


% of rHuEPO 


U/mg 


% oi rHuEPO 


(U/mg) 


rHuEPO 


175.000 


100 


179,000 


100 


144,000 


ArgU/GIn 


32,000 


18 


6,100 


3.4 


193,000 


Trp51/Phe 


40,200 


23 


54,000 


30 


454,000 


Asn147/Lys 


1,600 


0.92 


980 


0.55 


500.000 



Activity was determined as described in Table 1 and the Materials 
and Methods. Specific activity was calculated from activity (U/mL) by 
dividing by the protein concentration (determined from absorbance 
at A280). 



h:is a net chnrgc of -4c, and the prc:cin portion ofrHiiEPO 
has a net charge of -fSe. EPO is more like IL-4, which has 
a net charge of +7e. In this case, the charge in the IL-4 
receptor at the predicted IL-4 receptor-hormone interface is 
primarily negative.** This suggests that the positively 
charged (basic) residues on rHuEPO may interact with the 
negatively charged (acidic) residues on the EPO receptor. 

We and others have repoiled models of rHuEPO structure 
based on the four-helix-bund!e motif.-"-'-^^ The four re- 
gions important for bioactivity identified by ourselves and 
others^"'^' map to two sites on the rHuEPO models. These 
include site I, containing the residues between amino acids 
41 and 52 and amino acids 147, 150, and 151. The second 
site includes residues H, 14, 15, 100, 103, 104, and 108. 
Mutations in site 1 have substantial effects on both receptor 
binding and in vitro biological activity. Mutations in site 2 
have modest effects on receptor binding activity and much 
greater effects on in vitro biological activity. 

This pattern of active site residues is analogous to that of 
growth hormone, which has two binding sites, exists as a 
monomer in solution, and also has a four-helix-bundle struc- 
tural motifs 

Mutations in the two different sites in rHuEPO and human 
growth hormone behave similarly with respect to receptor 
binding activity and biologic activity. The observations can 
be explained by a sequential binding model, whereby the 
residues in site 1 have high affinity and are responsible for 
the initial interaction with the human growth hormone recep- 
tor. The residues in site 2 have less effect on receptor bind- 
ing, but are required for dimerization. One consequence of 
the human growth hormone site 2 mutations is that they can 
act as antagonists/^ We have found that mutations in site 2 
in rHuEPO can also act as antagonists (data not shown). 
Also consistent with this proposal is the recent report that 
the extracellular domain of the EPO receptor and rHuEPO 
can form a 2: 1 complex in solution.'"' In addition, both high- 
and low-affinity binding can be detected. Thus, it appears 
thai EPO receptor dimerization may proceed in a manner 
similar to that of growth hormone. 
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Efficiency of signalling through 
cytoicine receptors 
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receptor orientation 
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Human erythropoietin is a haematopoietic cytokine required for 
the dififcrentiation and proliferation of precurM)r cells into red 
blood cells'. It activates ceUs by binding and orientating two ceU- 
surfacc erythropoietin receptors (EPORs) vdiich trigger an intra- 
ceDuJar phosphorylation cascade^. The half-maximal response in 
a ccUular proliferation assay is evoked at an erythropoietin 
concentration of lOpM (ref. 3), 10*' of its X;, value for erythro- 
poietin-EPOR binding site 1 (K, - 1 nM), and 10" of the Xa for 
erylhropoietin-EPOR binding site 2 (X^ « 1 OveraU half- 
maximal binding (IC50) of cell-surface receptors is produced with 
-0.18 nM erythropoietin, indicating that only -6% of the rcc^ 
tors would be bound in the presence of 10 pM erythropoietin. 
Other effective erythropoietin-mimctic ligands that dimerixe 
receptors can evoke the same ccDular responses*^ but much less 
efiBciently, requiring concentrations dose to their values 
(-0.1 jiM). The crystal structure of erythropoietin complexed 

Table 1 Statls to for data conection and rtnicture refinements 

Parameters 



to the extracellular ligand-binding domains of ^.erythropoietin 
receptor* determined at 1.9 A from two crystal forms, shows that 
erythropoietin imposes a unique 120** angular relationship and 
orientation that b responsible for optimal signalling through 
LntraccUular kinase pathways. 

A soluble analogue of Escherichia coli- expressed erythropoietin 
(Swiss Prot accession no. P01588) was made by substituting lysine 
residues at three N-linkcd glycos>iation sites (N24K. N38K and 
N83K). The extracellular ligand-binding domain of the erythro- 
poietin receptor (EPObp; Swiss-Prot accession number F19235) 
was expressed in Pichia pastoris cells, with the N-linked glycos)da- 
tion site removed by an N52Q mutation. An N164Q muution was 
included to remove the potential for deamidation at the normal -N- 
G- sequence'. An A211E mutation was introduced for better 
expression and potentially enhanced folding efficiency on the 
basis of its role in improving the transportation of EPOR from 
the endoplasmic reticulum to the cell surface*. 

Eryihropoietin-(EPObp)2 complex cr>'Stals dif&acting to 2.8 A 
(fl = 73.3A. b=S0.5k c=134.9A, P2,2,2,; Form 1) were 
obtained and the structure was determined by single isomorphous 
replacement (SIR) based on a mercury derivative, foUowcd by 
solvent density modification by the methods of Abrahams' b the 
SOLOMON algorithm (Table 1). This was alternated with density 
avera^g of equivalent pairs of individual receptor domains. 
Higher-resolution data (1.9 A) were obtained from a new crystal 
form, which used an erythropoietin analogue with two additional 
mutations, P121N and P122S, introduced into the CD loop. These 
changes were suggested on the basis of NMR relaxation data, 
which showed exuemely low-order parameters for the rcsidiics 
(Glu 1 17-Ala 128) in this loop with conformational heterogeneity 
in the backbone in the vidnity of the proline residues'". This 
analogue of erythropoietin, when complexed with EPObp, cryst^- 
lized in a new unit ccU (a = 58.4 A. b = 79.3A.^c= 136.5 A, 
P2,2,2i; Form 2) with a diffraction limit beyond 1.9 A. The crystal 
structure of the erythropoictin-(EPObp)2 complex (Form 2) was 
solved by multiple isomorphous replacement with anomalous 
scattering (MIRAS) using one uranium and two platinum deriva- 
tives (Table 1). -ucni-iU 

The structure of human erythropoietin complexed with EPUbp 



Data ooOectipn 
Re$olut»on(A) 
Total ot>servaik)ns 
Unique reftections 
AMef89e//« 
Comp(etenesst%) 



Native 
(Forml) 

88.794 
20,677 

ae 

916(892) 
122(37.4) 



HqACi 
(Fofml) 

2.8 
t062^ 
19^ 
93 
88i>(86.3) 
9jB(3&2) 



Native 
(Form 2) 

19 
261681 
49.480 
19.1 (2.6) 
97X>(82^) 
4.4(37.7) 



(Form 2) 

2.3 
262332 
28338 
1^0(3.9) 
99.7(97.6) 
7JB(46.9) 



(Fonn 2) 



2.6 
167.497 
22.689 
1i7(5i>) 
99.9 (99J8) 
92(46J9) 



KjPttNQz), 
(Form 2) 

2.6 
137.462 
21321 
1Z4(3.7) 
93.4 (83JB) 
7.7C44JB) 



Phase determination 
Soaking oonc. (mM) 
I Soaklogtime(h) 
Number of sites 

i «i«,(*fcyt 
1 «.(*)» 

I Phasing povvet) 



land 10 
16 

2 

73 
643 
16/10 



0.1 
10 

2 

7,4 
66.0/43.0 
16/16 



0.1 
11 
2 

- \Z.7 
62.0/44.0 
17/15 



4 

24 

2 
12.0 
660/44i) 
13/14 



Retinemem 

ResoKitk>n(A) 

Reflection5.F>2« 

«a^/«*«SW 

Na water mote 
RjiLixL bond length (A) 
RjiLi.dbpndans^(deg) 
S.fectorcA') 



80X>*2.8 
18.797 
19J0/33.0 
0 
0,008 
139 
STjO 



&0J0-19 
44.607 
242/318 
300 
OJ006 
U 
343 
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,hows .ha, one molecule of erythropoietin ^^"''^.'^^JXZ l/c 
bolh crystal forms, the anangemcnt of molecules .s s'"^''" 
p^^ctTn. although the pacldng of molecules .s closer a^^^^^^^ 

in the a direction in Form 2. The most s.gn.ficam <|'ff"^"<f " ''^ 
positioning of .he D2 domains relative .o each « J- 
ihere are no receptor-receptor contact, whereas the "^o" '^*«'y 
packed Fom. 2 structure has a hydrogen-bonding -n.act bej.een 
Se side chains of residues Ser 135 and Glu 134 on adjacent EPObp 
D2 domains. This is in sharp contrast to the analogous structure 
with human growth hormone (hGH) bound to its receptor 
(hGHbp). in which .he D2 domains of the two receptor "'o'^^'" 
interact extensively with a buried surface area of over 900 A 
(ref 1 1) The Form 1 and Form 2 structures provide mdependen. 
[orrobora.ion of key features of .he ^<>'-P'^^- .^^^'^^^"''^^ 
overall s.ruCure from boU, crys.al forms .s very «rnJar with respe J 
I the interfaces bet>veen erythropoietin and EPObp. TTie angle 
between the adjacent EPObp Dl domains is (he same. supporUng 
the conclusion that Uie in.erfaces in .he crys.al s.ruc.ures represent 
those achieved by the complex at Ute ceU surface. 

Erythropoietin has an up-up-down-down four-helicJ bun^e 
to^ogy" with interheUcal angles similar to those of the l?ng- cham 
dS. fSexample hGH and gnmul.ocyte ~lony-sttmulatin«fac.or^ 
H^ever. i, alsi contains two small antiparaUel p-strands typ cal of 
theshort-chain class'\for example macrophage colony-stxm^^^ 

factor. stem-ceU factor, interleukin-4 and mterleuk,n-5. One pa^ of 
antiparalldlonghelices.«A(residues8-26)andaD(r«.duesl38- 

161) is held together by a disulphide bridge. Cys7 to Cys 61 The 
iA;pa^.aB{Lidues55-83)andaC{residucs90-112^shnked 

Srashortloop(FiE.l).TT.eaDhelix is sUghdy irregular b^l^^^^^^ 
a smaU kink at Gly 151. The short segments of ammo aads ftom the 



EPO 



EPObp2 



EPObp\ 




B1I39-411 82(133.135). DisulpNde MdQes: CysT^CyslCV 0^29^33. D^ 
Wl fiEI7fr«i eft^m. pG(106-t13>. OtoUM*te bfWges: Cys2a-CVB38. 

' l«m-163» «> W-a* The lo«-on. «l 



long AB and CD crossover loops in.eraclr*«ith each other to forman 
antiparallel P-sheet: pi (residues 39-41) and P2 (residues 133- 
135). Several aromatic and hydrophobic residues, such as Phe 138, 
Phe 142. Tyr 145. Phe 148, Leu 153 and Tyr 156, on the interior face 
of .he D-helix, pack against the non-polar side chains from the A, B 
and C helices to form the hydrophobic core of erythropoie.in. A 
second disulphide bond. Cys 29 to Cys 33. links the end of the aA 
helix vsath part of the AB loop. Erythropoie.in has two additional 
short helices, the oB' helix (residues 47-52) orthogonal to aB and 
the mini-helix aC (residues 114-121) following aCwiOi a 90' .ih 
beginning at Gly 11 3. . , , , 

The structure of EPObp is composed of a short ammo-.ermmal 
helix (~ 15 residues) followed by two P-sandwich domains, Dl (N- 
tcrminal) and D2 (carboxy-terminal), each consisting of -100 
residues. The overall fold of EPObp is as described previously for 
die crystal structure of the erythropoietin receptor (EBP) com- 
plexed to a peptide agonist (EMPl)*. The N-terminal helix (residues 
9-22) of both EPObp structures is tucked into the elbow formed by , 
the Dl and D2 domains (Figs 1 and 2a), in close proximity to .he 
WSXWS motif (residues 209-213; X = Glu) in D2. The side chain 
of helbt residue Uu 18 makes hydrophobic contacts (<4.5 A) with 
Phe 29. Leu 27. Leu 120 and Phe 208 of EPObp. Alteration of the 
WSXWS sequence disruptt erythropoietin binding and recq>tor 
siRnalling", correlating with similar findings for the inteTleukin-2. 
prolactin and hGH receptors". The WSXWS motif has been shown 
to be critical for the folding and transport of the receptor to the cell 
surface, and an A21 IE mutation further improved the efficiency of 
the processes'. In the erythropoietin-(EPObp)i complex, the side 
chain of Glu211 f^om the WSXWS motif is closest (<43A) to 
Leu 17 of the N-terminal receptor helix. Additionally, the side 
chains of residues Trp209 and Trp212 sandwch the hydrophobic 
side chain of Argl97 in the receptor fold, whereas Ser 210 and 
Ser213 are within hydrogen-bonding distance of Ala 198 and 
Val 196. respectivdy. On the baas of the interactions of the 
WSXWS box with the N-temrinal helix and the ^-sheet residties 
Val 196. Aigl97 and Ala 198 in the «rythropoietin-(EPObp), 
complex, we suggest that the N-tenninal hdix is bioJopoDy 
important in stabilizing the folded EPOR.niis helix position differs 
from the N-tenninal beUces seen in the two-fold symmetnc EMPl- 
EBP complex, in whidi the connection between hdix and Dl was 
not visible and unconnected heUces were farther away from the Dl/ 

02 elbow (Fig. 2b)*. , , 

In the erythropoietin-(EPObp), complex, the receptor mol- 
ecules are held together through two regions »ocat~ <»•> opP«»«< 
feces of erythropoietin (Rgs 1 and 2a and Tabk 2). •n*^"!^ 
Doietin-EPObp intafeces haw been identified as high-affimty 
(X. - 1 nM) and low affinity (K^ - I |iM) and are referred to as 

she 1 and site 2 respectivdy*. Site 1 is composed of enrjhiopoictin 
residues fiom hdices oA. oB', oD and part of the AB loop. Site 2 

erythrop««etin readues are located m oA and oC Com^f»«»*^ 
receptor intcifeces invoWe contributions from six EP<*P ]<»^ 
(LI -L6) between P-strands from both domains: U, U and L3 lUl 
domain): U from the polypeptide pi-D2 »^ " 

(D2 domain). Residues from aD six loops are inwivtd m Ae site 1 

inter&ce and dl except the fourth »«>P P^^^^^^Sl h^^^ 
Site 1 fa diaracteriied by a caitnd hyditqiliobic bmdmg po^ 

ftaAed at opposite ends by hydrophiBe fataacdons CJbk 2) 
EPObp loops LI, L5 and L6 acoommodaU the hydrophobic ery- 
,hropoi«dn>esldu««,with Pbe95 <>f 

poto interactioos. This pivotal raidiie b finnjr Md m ^ ^ 
gSrogm bonding to eiythiopoietin 
3^oftbephaMring«rfl1«»I»d»«-^^ 
(at i^ance of 3JI A) and fa aewmnodbted at a MjgJ** 
«„fi«c«atedbr.q«.itetofeiythrof«fclfai«aed^(^ 

11.e4« andl« 155). MBtageo«Ate Ao«n^^ 
the Kcqtor fa a crilial deteimhiant of wrlhrogieg^^ 
P93A moution cfiminales any detectable Wnfing^- »« 



NATUREIVOl 3951 1 OCTOBER I998l» 



w.naturcCD' 



letters to nature 



interface also has contributions from the A-B linker polypepfadc 
residues (Thr 44-Phc 48) of erythropoietin. A smaller contribution 
is made by Met 1 50 of EPObp. which is within van dcr Waals contact 
distance of only a single amino add. Phe 48, of erythropoietin. The 
hydrophobic binding surface (site 1) itself is surrounded by hydro- 
phiHc interactions (Table 2). Asn 147 of erythropoietin is central to 
the hydrophilic D helix intcracUons. It has three hydrogen bonds to 
EPObp residues, one of which is between its N8 atom and the 
carbonyi oxygen of Phe 93. On cither side of the Asn 147 are 
aipninc residues. Arg 143 and Arg 150. that interact with glutamic 
adds GIu60 (U) and Glull7 (L4). respectively. Flanking the 
binding site at one end, Ser9 and Arg 10 from the A helix of 
erythropoietin interact with GIul76 and His 153, located at the 
end of pC and in the L5 loop of the receptor, respectively. At the 
opposite end. Arg 143 and Lys 140 of aD, and Lys 45 of the AB loop 
of erythropoietin, interact with residues Glu 60, Asp 61 and Glu 62 
of the L2 loop of the EPObp Dl domain. 



At site 2. the interactions are less extensive than at sPf^l. Most of 
the interactions are between residues of the C helix of erythropoietin 
and the L3 loop of EPObp. The hydrophobic surface is created on 
EPObp by residues Phe 93. Phe 205 and Met 1 50. although because 
of the positioning of the Dl domain relative to the interaction of site 
2 with erythropoietin there is a relatively flat surface that erythro- 
poietin residues can interact with. The closest non-polar contact is 
between erythropoietin Uu 108 and EPObp Phe 93 07(3.9 A). The 
rest of the C helix residues are positioned at a greater distance from 
the EPObp hydrophobic surface than at the site 1 interface. Phe 93 is 
again the major contributor to the central hydrophobic binding 
pocket (Leu 5. Valll. Tyrl5. Ser 104. Thr 107 and Leu 108 of 
erythropoietin are within 4.5 A of EPObp Phe93). Met 150 of 
EPObp. however, is more buried than in site 1. making van der 
Waals contacts with Arg 10, Val 11 and Arg 14 of erythropoietin. A 
total of 12 hydrophilic interactions are formed between residues in 
the A and C helices of erythropoietin (Asp 8, Arg 1 4, Lys 97, Ser 1 00, 




ngur* t Stefw> vlww. eiythfopoWfHEPObpJ^ t. (Efc^l-EBPIb. C hGH- 
(hOHfapk. Al the woepttw ere coloured blu©, w»i Iheir HQantfs In red. The 
fcitoriwtocutof cortactB (<4^A) IrwoM^ 
yelow: IQWJ omto acids ¥^ tf» reoep» on the 

erytfiropotethi ewJ h6H) ere shOMim In 



receptor on ihe IcMwnd side (si* 2 tor eiythfopol^ 

wNte. Receptor orfemartorw are also rcpfcserted echernatk^ 

the hofteonm membrane plane shown In grey w6 the oo<respon<fifHI vertfeal 

plarte draMn in • chequered pettra 



513 



letters to nature 



Site t atoms 



Sfie 2 atoms 



EPO 



Lys20 Ni 
Thf44 Oy 
Lys46 NC 
Val46N 
Val4€0 
Asn 47 Nh2 
Asn 47 N&2 
Arg 131 Nn2 
Lys 140 Ni 
Arg 143 Nt>2 
Asn 147 Nh2 
Asn 147 N&2 
Asn 147 0&1 
Arg 150 O 
Arg 150 NtjI 
Arg 150 Nt)! 
A/g 150 Nn2 



EPObp 

Gtu202O<l 
Phe93 0 
Glu62 0€i 
Ser92 0 
Ser92N 
Thf 87 O 
Ala 680 
Asp61 O 
Asp6l 0&2 
G»u60Oc2 
Pbe93 0 
His 114 Nt2 
His 114 Nil 
Ser204 O7 
Pro 203 0 
Glu 117 Ot2 
Glun7 0*2 



Distance (A) 

3.5 
2.6 
3.1 
3.0 
2.8 
2.9 
3.2 
2.8 
3.0 
2.8 
3.1 
3.0 
3.5 
2.6 
2.8 
3.2 
2.8 



EPO 

Asp8062 
Arg 14 N-r>2 
Lys 97 N{ 
Sef 100 0 
Arg 103 N* 
Arg 103 N* 
Arg 103 N^l 
Arg 103 th\\ 
Arg 103 N'r»2 
A/g 103 
Ser 104 O^ 



EPOt>p 


Disiance (A) 




HislS3r4c2 


2.9 


Leu 33 0 


3.1 


Glu 34 0*1 


2.7 


Ser 91 Or 


2.9 


Glu 62 0*1 


3.1 


Glu 62 0*2 


3.5 


Ser 91 Ot 


2.7 


Ala 88 0 


2.6 


Ala 88 0 


3.3 


AspeSO&l 


3.1 


Ser 92 0 


2.9 



Are 103 and Ser 104) and loops LI, L2, L3 and L5 of EPObp (Leu 33, 
Glu 34, Glu 62, Ala 88. Asp 89, Scr91, Scr92 and Hisl53). w,lh a 
total of 1 1 hydrogen bonds (Tabic 2). The side chains of helix A 
residues Asp 8 and Arg 14 form hydrogen bonds with the side chain 
of His 153 and the carbonyl oxygen of Leu 33. respectively. 

In general, the sidechain interactions at both interfaces arc 
predominantly between positively charged lysine and arginine 
residues of erythropoietin and negatively charged aspartate and 
glutamate side chains of EPObp. Site 1 contains almost twice as 
many sidechain-sidechain interactions as site 2. However, the 
numbers of hydrophiUc contacU involving main-chain atoms 
from cither erythropoietin or EPObp are equal for both sites. In 
addition, site 1 contains two hydrogen bonds that involve main- 
chain atoms from both erythropoietin and EPObp. Hydrophflic 
interactions f^om three EPObp residues (Glu 62, Ala 88 and Ser 92) 
are common to both interfaces, two of which invoWe hydrogen 
bonding from a main-chain atom (Tabic 2). Although most residues 
that interact across both interfaces are polar (Table 3). the expected 
contribution to binding affinity is greatest from the hydrophobic 

contacts. . . ^ 

Although the crystal structure of erythropoietm-(EPUbph 
clearly shows an extensive interface between erythropoietin and 
its receptor (buried surfece areas at sites 1 and 2 arc 920 and 660 A . 



respertively), a much smaller subset of residues, identified by 
mutagenesis, delineates a functional epitope (Table 3)**"^'. Gly 151 
has an important structural role in erythropoietin, forming a kink 
in the D helix that brings the side chain of Lys 152 into hydrophobic 
conuct with Val63, Trp51 and Phe 148 in the protein core. 
Replacement of alanine at either position 151 or 152 causes a 
substantial loss of bioactivity**"^'. The basic residues Lys 20 and 
Lys 45 on erythropoietin in the interface of site 1 are unaffected by 
Ala substitutions", but mutations to acidic residues do cause 
substantial loss of bioaaivit/'. Arg 103 of erythropoietin has 
been implicated in site 2 binding'*"" and from the structure we 
sec that it is completely buried at the interface, interacting with six 
residues on EPObp. Erythropoietin mutant R103A has been shown 
to have unchanged site 1 affinity, but to lack proliferative function 
owing to loss of site 2 binding*. A sequential binding mechanism 
was proposed on the basis of this analysts of several erythropoietin 
mutants. Arg 14 on erythropoietin, which interacts with four 
residues on EPObp, is also very sensitive to mutagenesis'^'*. 
Sedimentation €quihl)riura data have shown that the R14Q muU- 
tion in erythropoietin reduces the site 2 afi&nity roughly fivefold 
(J. Philo, personal communication). 

Comparison of the erythropoietin-(EPObp)2 interactions with 
those of hGH-(hGHbp)2 (ref. 11) reveals a number of significant 



Tab>» 

Sftel 



EPO 
reskkie 



Burted 
surface aree 



Neighbouring 
EPObp residues 
(4SA cutoff) 



Effect c4 EPO 
mutagenesis 
on WoactMlyt 



EPO 


Burled 


residue 


surface area 






UB 


G6 


De 


23 


Rio 


51 


VII 


49 


RM 


106 


Y16 


16 


Q78 


11 


096 


44 


K97 


36 


V»9 


7 


SKX) 


40 


R103 


118 


SKM 


38 


7107 


60 


Lice 


15 


RtIO 


26 



Neighbourir^ 
EPOt)p residues 
(4SA cutoff) 



Effect of EPO 
mutagenesis 
on bioectlviiyt 



RK) 
E13 
L16 
L17 
K20 
T44 
K46 
V46 
N47 
F48 
Y49 
ICS2 
R131 
1133 
K140 
RMS 
Nt47 
R160 
0161 
K164 
L156 



17 
24 
30 
22 
15 
48 
19 
107 
38 
67 
119 
61 
17 
34 
22 
31 
43 
43 
128 
M 
32 
26 
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differences. In hGH-(hGHbp)2, two different regions of the AB 
loop of hGH (the start of the loop at Gin 46 and the end of the loop 
at Arg64) are involved in site-1 ligand- receptor interactions. In 
contrast, only the C-ierminal end of the AB loop of erythropoietin, 
at residue Arg45. is involved in sitc-1 erythropoietin- EPObp 
binding. In the hGH-(hGHbp)2 complex, two hGHbp residues, 
Trp 104 and Trp 169, form a central hydrophobic pocket flanked by 
hydrcphilic interactions. In comparison, in the crythiopoietin- 
(EPObp)2 complex, Phe93, the direct homologue of Trp 104 in 
hGHbp, IS the primary contributor to the interactions between 
receptor and ligand in the hydrophobic pocket; Met 1 50 on EPObp 
(homologous to Trp 169 on hGHbp) is more peripherally involved. 
As a result of these differences, the binding between ligand and 
receptor mediated by the central hydrophobic pocket of the receptor 
seems to be more localized for erythropoietin- (EPObp)2 than for 
hGH-(hGHbp)2. This region has been identified as a mutational 
hot spot in hGH-(hGHbp)j (ref. 22). 

Comparison of the er)nhropoictLn-(EPObp)2, (EMPl-EBP)^ 
and hGH-(hGHbp)2 structures reveals that the receptor molecules 
have significantly different orientations relative to each other 
(Fig. 2a-c). Viewed perpendicular to the membrane plane, the Dl 
domains from the two receptor molecules in the crythropoietin- 
(EPObp)2 complex are positioned ---120* relative to each other, 
compared with 180* in (EMPl-EBP):. For the hGH-(hGHbp)2 
complexes, the angle is 160®. The two D2 domains in the erythro- 
poietin- (EPObp)2 and hGH-(hGHbp)2 complexes are in approxi- 
mately the same plane, perpendicular to the membrane. In contrast, 
the two D2 domains in the (EMPl -EBP)2 complex are twisted away 
from each other by ^45". 

Disulphide. crossUnking between murine erythropoietin recep- 
tors, generated by the replacement of Arg 130 with cysteine, resulted 
in constitutive activation". The two EPObp Arg 130 residues, 
located near the membrane-proximal region of the D2 domains 
in the two adjacent receptors, are 27 A apart. Formation of a 
disulphide bond would therefore require significant angular rear- 
rangemciit of the receptor domains. Signalling in this case might be 
the result of a subset of the receptor molecules dimerizing intra- 
ccUtilariy before being transported to the ceQ sur^cc, as only a 
fraction of EPOR achieves constitutive activation". Alternatively, a 
constrained angular relationship in the mutant murine dimers 
could bring about an impaired signalling event while still allowing 
erythropoietin to bind to its receptors. 

The dSidcncy of signalling by erythropoietin mimetics is typically 
at most 1/20 that of the natural ligand\ Our results therefore show 
that the activating efficiency of EPOR* and by inference each of the 
cytokine receptor complexes, depends critically on the separation, 
orientation and relative disposition of bound receptors, and hence 
the effect on their tinique intraceOular domains. This consideration 
is therefore an important parameter in the design of erythropoietin 
fnimetics; specifically, it suggests that asymmetrical molecules 
anight be required to achieve these orientations most efficiently. □ 



Vot«ln preparation and crystalHzatlon. Recombinant human erythropoietin 
nutants, r-mct lyi bu aythropoktin N24K N58K N83K and r-mct lyi hu 
TythropoMn N24K N38K N83K P121N P122S, wtit produced in £ cd6 ceOs 
Old purified as described'*. Recombinant human EPObp (r-REF huEPOR 
"ISZQ N164Q A2I1E) %/as secreted from Pkhia pastoris^ and purified as 
lesoftwd {HJL a dL, manuscript in prqnntion). EPOI^ contains three N- 
cnnmal residues (Arg-Ou-Pbe) that rqilace Ab in dtc wfld-type scquenoe'^. 
loth crythrapoietin-CEPObp)] complocs were crystaOixcd by hangEng-drop 
apoor difittskm al.20*C Protcio was mixed with an equal vohune of the 
lyitaOaMm-wdl bufocootatmn^ 15% PEG 4K, 02H OOt and OilM 
4ES pH 63-7jO (Ikiffm 1) and m cryoprotcctam coo£tions of 52H PEG ISOO, 
'MUJWMSO^ MES, pH 63 (Fonn 2). Fofm-l data from three 
•oaMrpboos oystab (csOecled at 20X} omstitttcd the in^ 

were collected with an R-AXIS n ima^e pbte sfrtcin iDOunted on a Rigaka 



RU200 generator using a rotating copper anode target tube operating at 50 kV 
and 60 mA. These were autoindcxed, integrated, scaJcd and merged with the R- 
AXJS associated data- reduction software BIOTEX (MoIecuJar Structure Cor- 
poration). Forni-2 data were coUectcd with a single crystal at 1 OOK on an R- Axis 
FV imaging system installed on an 18 kW Rjgaku RU300 generator. Data were 
indexed, integrated and scaled with the programs DENZO arxJ SCALEPACK*. 
Structure determination and refinement. For Form I , two highly anisoUopic 
mercury sites from the single derivative (Table I) were refined with 
MLPHARE" and double-difTcrencc maps (mean figure of merit = 0.31). 
Density modification, with SOLOMON* and DM", yielded a map at 4 A 
resolution that outlined the a-helicc$ in erythropoietin and the pattern of p- 
sheets in the ordered chains of EPObp. 500 cycles of density modification 
followed as the resolution was extended from 4.0 to 2.8 A in 0.004-A steps. At 
each step the solvent mask was recalculated by using a radius equal to the 
resolution of the map. Structure factors calculated from the modified map were 
used to estimate the phases in the next resolution shell These were not 
combined with the SIR phase probabilities owing to the low phasing power 
beyond 4 A Further to resolution extension, the density for domains DI and 
DT, D2 and D2' were averaged to improve the phases iteratively by imposing 
consUaints of e<]uivalencc in the domains. The Dl and D2 domain boundaries 
for each receptor were identified by inspection and the rotation angles between 
sinular domains were determined by maximizing the correlation between the 
selected map regions. The initial structure solution gave an Kb« of 46%. The 
structiirc was refined with XPLOR3.85! (ref. 28) interieaved with density 
averaging between the Dl/Dl' and D2/D2' domains and calculation of omit 
maps. Several rounds of refinement and building to 'complete* omit and 
2F, - f, maps resulted in the final model {R^/Ri^ = 19.0%O3.0%). Towards 
the latter stages of refinement, the release of the (EMPI-EBP)2 coordinates 
(Brookhavcn Protein Data Bank (PDB) accession number lEBP) allowed the 
positioning of some surface exposed \oops in the EPObp rcceptOL 

For Form 2. heavy- atom sites were identified for each derivative in difference 
Patterson maps, refined for their positions and occupandes, and confirmed by 
examining the peaks in cross-phased difference Fourier maps'. The MIRAS 
phases calculated^ by using isomorphous and anomalous differences from the 
three derivatives (Table 1) %rith the native data (mean figure of merit 0.72, 
50.0-23 A) and soNent flattened with SIGMAA weighting (mean figure of 
merit 0,92, 50.0-23 A) produced a good-quality map. This map was used to 
buiW the complete model of erythropoietin- (EPObp)2 with the program 
package QUANrA97 (Molecular Simulations). The starting tempbtes for the 
Dl and D2 domains of EPObp used in model building were taken from the 
preliminary crystal structure of the (EMPl-EBP)^ comfJex (TJ>.0. ef at, 
manuscript in preparation) reproduced from previously published workt. The 
locations of cysteine residues were confirmed by examining an anonuJous map, 
which showed dear peaks for sulphur atoms contoured at 2.0tf . The stnicture 
vras refined with 705 reflections (39b) exduded for cross-validation (J^ by 
using simubted «nn<^i;ng accompanied by the bulk sohcnt correction proto- 
<xA in XPLOR3.851 {rtC 28). The model after nine cydcs of manual modd 
building, simulated annealing, and positional and temperature bctor rcfine- 
menu yielded an J^, and an l?fc« of 0.236 and 0339 (50.0-23 A. f> 2o,h 
rcq)ective)y. An additional three cydes of model buildxi^ refinement and 
phase combiiMtion at 1.9 A resolution with 1323 reflections (3%) excluded for 
cross-validation {F^ resuhcd in a crystaDographic RayJRt^ of 0J42/0318 
(50.0-1.9A,F>2ff,). 
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The mitochondrial outer mcnibrane contains madiiiicry for tlie 
Import of prepiotdns encoded by niicktf genes'"*- Eight different 
Tom (tianslocase of outer membrane) proteins have been Identi- 
fied that function as receptors and/or are rdated to a bypothetica] 
general import pore. Many mhodMMidria! membrane channel 
activities have been described^, incio^ one nUted to llm23 
of tlie inner-membrane proCcin4nqN>rt sfrtcm^ bovvcvcr» die 
poK4bnning sabinJt(s) of the Tom machinery have not been 
Identified mtil now. Here we desaribe tlie cspression and Innc- 
tfonal leconstitntioo of IbmiO^ an intcg;ral memlirane protein 

BuJnlf p-dftecl strnctaic. IbmM f(»^ 
l,ywCT,»^«fw» A««nrf Aat yrfficagy Mnds to and trans- 



ports mitochondrial- targeting sequences added to the cis side of 
the membrane. We conclude that Tom404s^the pore-forming subunit 
of the mitochondrial general import pore and that it constitutes a 
hydrophilic, —22 A wide channel for the import of preproteins. 

We expressed Sacchawmyccs ccrevisiae Tom40 (ref. 8) in 
Escherichia coli cells (Fig. la, lane 2), Tom40. representing -25% 
of total £ coli protein, accumulated in inclusion bodies and could 
be solubilizcd with urea. As Tom40 may be slightly similar to porins 
at the secondary structure leveP. we modified a method for the 
preparation and renaturation of Rhodopseudomonas blastica 
porin from £. coli inclusion bodies'**. Tom40 was isolated to high 
purity (>95%) (Fig. la, lane 3). The two protein bands of minor 
abundance that are visible in the purified preparation of Tom40 
(Fig. la, lane 3) are degradation products of Tom40. When diluted 
into the non-ionic detergent nonanoyl-N-methylglucamide (Mega- 
9), the recombinant Tom40 migrated as a single band on blue native 
electrophoresis and this band was indistinguishable from that of 
Tom40 that had been solubilized from mitochondria by Mega-9 
(Fig. lb). Circular dichroism analysis of Tom40 in urea and after 
dilution into Mega-9 indicated an efficient refolding of the protein 
upon dilution (Fig. Ic). We inserted Tom40 into liposomes by 
dilution of the urea-denatured protein into a mixtiuc of Mcga-9 
and azolectin, with subsequent removal of the detergent. The 
circular dichroism spectrum of liposome-inserted Tom40 was 
comparable to that of Tom40 in Mega-9 (Fig. Ic). A secondary 
structure calcxilation according to ref. 1 1 indicated a predominance 
of P-sheet structure (> 60%) and little a-helical structure (<5%) in 
the protein. 

To assess whether Tom40 was properly reconstituted in the 
liposomes, we determined its topologjcal orientation in liposomes 
in comparison to that of Tom40 in mitochondria. Treatment of 
mitochondria with trypsin cleaved a small peptide (relative mole- 
cular mass (M,) -2.5K) from Tom40 (Fig. Id, upper panel, lanes 2, 
3). The same cleavage occurred after recombinant Tom40 inserted 
into liposomes was treated with trypsin (Fig. Id, lower pand, lanes 
2, 3). The remaining fragment of Tom40 was protected by the 
membranes, as, after lysis of mitochondria or liposomes with Triton 
X-100, it was digested by trypsin (Rg. Id, lane 4). Antibodies 
directed against the amino-terminal or carboxy-tcrminal rcpons 
(12 terminal amino-acid residues in each r^on) of Tora40 showed 
that the C-terminal epitope of mitochondrial Tom40 was retained 
in the fragment (Fig. Id, upper pand, lanes 6, 7), whereas the 
N-tenninal epitope was removed by trypsin (Fig, Id, upper panel, 
lane 10). The same topology of Tom4a was seen in the proteo- 
liposomcs (Fig. Id, lower pand, lanes 6. 7, 10). A quantitative 
assessment of the fragmenUtion of Tom40 in mitochondria and ! 
liposomes (induding a consideration of the higher renstanoe of 
mitodiondrial membranes to trypan) indicates that at least 50- 
60% of the recombinant Tom40 was inserted into liposomes with 
the correct orientation (another fraction of recombinant Tom40 
was fuDy digested by trypsin even in the presence of intact liposome 
membranes, probably representing Tom40 associated with the 
liposome preparation but not inserted into the membrane). 
Indeed, a prcprotcin specificaily affected the diannd activity only 
when added to the ds side of the membrane (tlut is, to the N 
terminus of Tom40) (see bdow). We condude that the recombinant 
Tom40 molecules that were functionaDy incorporated mto 
liposomes were asynunetrically inserted with the same rdatWe 
orientation as in the outer mitodiondria] membrane. 

lb determine whether reconstituted IbmiO could form a chan- 
nel, we peiformcd dectropfayaological studies. After ftinon cf 
lbm40-containing liposomes with a planar Inlaycr, nngle-<hannd 
currents could indeed be measured (Rg. 2a). At membntne p oten* 
tiah bekw 100 mV, the channds vrcre mainly open and we €ft»crvcd 
brief; fliclEiaing closures of the channd. When examined at higher 

time resohitioo (lOkHx). die corrent reoorfings Aowed subooo- 
ductance Icveb (kwer trace. Kg. 2aX T1>e coitent tiacw dnm that 
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